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Novel Human £ 2 Integrin Alpha Subunit 

This application is a continuation-in-part of U.S. Application Serial 
No. 08/286,889, filed August 5, 1994, which is pending, which in turn is a 
continuation-in-part of U.S. Application Serial No. 08/173,497, filed December 
5 23, 1993, which is pending. 

Field of the Invention 

The present invention relates to the cloning and expression of 
polynucleotides encoding a novel human (3 2 integrin a subunit, designated a d , 
which is structurally related to the known human j3 2 integrin a subunits, CDlla, 
10 CD1 lb and CD1 lc. The present invention also relates to polynucleotides isolated 
from other species which show homology to human a d encoding sequences. 

Backgroun d of the Invention 

The integrins are a class of membrane-associated molecules which 
actively participate in cellular adhesion. Integrins are transmembrane 

15 heterodimers comprising an at subunit in noncovalent association with a £ subunit. 
To date, at least fourteen a subunits and eight 0 subunits have been identified 
[reviewed in Springer, Nature 346:425-434 (1990)]. The 0 subunits are generally 
capable of association with more than one a subunit and the heterodimers sharing 
a common 0 subunit have been classified as subfamilies within the integrin 

20 population. 

One class of human integrins, restricted to expression in white 
blood cells, is characterized by a common 0 2 subunit. As a result of this cell- 
specific expression, these integrins are commonly referred to as the leukocyte 
integrins, Leu-CAMs or leukointegrins. Because of the common 0 2 subunit, an 
25 alternative designation of this class is the £ 2 integrins. The /3 2 subunit (CD 18) 
has previously been isolated in association with one of three distinct a subunits, 
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CDlla, CDllb or CDllc. The isolation of a cDNA encoding human CD18 is 
described in Kishimoto, et aL, Cell 48:681-690 (1987). In official WHO * 
nomenclature, the heterodimeric proteins are referred to as CDlla/CD18, 
CDllb/CD18, and CDllc/CD18; in common nomenclature they are referred to 
5 as LFA-1, Mac-1 or Mol and p 150,95 or LeuM5, respectively [Cobbold, et aL, 
in Leukocyte Typing III, McMichael (ed), Oxford Press, p.788 (1987)]. The 
human ($ 2 integrin a subunits CDlla, CDllb and CDllc have been demonstrated 
to migrate under reducing condition in electrophoresis with apparent molecular 
weights of approximately 180 kD, 155 kD and 150 kD, respectively, and DNAs 

10 encoding these subunits have been cloned [CDlla, Larson, et aL, J. Cell Biol 
708:703-712 (1989); CDllb, Corbi, etal.,J.BioLChem, 263:12403-12411 (1988) 
and CD1 lc, Corbi, et aL EMBO /. 5:4023-4028 (1987)]. Putative homologs of 
the human, jS 2 integrin a and ; i8 chains, defined by approximate similarity in 
molecular weight, have been variously identified in other species including 

15 monkeys and other primates [Letvin, et aL, Blood 67:408-410 (1983)], mice 
[Sanchez-Madrid, et aL, J. Exp. Med. 154: 1517 (1981)], and dogs [Moore, et aL , 
Tissue Antigens 36:211-220 (1990)]. 

The absolute molecular weights of presumed homologs from other 
species have been shown to vary significantly [see, e.g., Danilenko et aL, Tissue 

20 Antigens 40: 13-21 (1992)], and in the absence of sequence information, a 
definitive correlation between human integrin subunits and those identified in 
other species has not been possible. Moreover, variation in the number of 
members in a protein family has been observed between different species. 
Consider, for example, that more IgA isotypes have been isolated in rabbits than 

25 in humans [Burnett, et aL, EMBO J. 3:4041-4047 (1989) and Schneiderman, et 

aL, Proc.Natl.Acad.ScL (USA) 86:7561-7565 (1989)]. Similarly, in humans, at ) 
least six variants of the metallothionine protein have been previously identified 
[Karin and Richards, Nature 299: 797-802 (1982) and Varshney, et al, 
Mol. CelLBiol 6:26-37, (1986)], whereas in the mouse, only two such variants are 
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in evidence [Searle, et al, Mol.Cell.Biol. 4: 1221-1230 (1984)]. Therefore, 
existence of multiple members of a protein family in one species does not 
necessarily imply that corresponding family members exist in another species. 

In the specific context of /S 2 integrins, in dogs it has been observed 
5 that the presumed canine £ 2 counterpart to the human CD 18 is capable of dimer 
formation with as many as four potentially distinct a subunits [Danilenko, et al, 
supra]. Antibodies generated by immunizing mice with canine splenocytes 
resulted in monoclonal antibodies which immunoprecipitated proteins tentatively 
designated as caninehomologs to human CD18.CD1 la, CDllband CDllc based 
10 mainly on similar, but hot identical, molecular weights. Another anti-canine 
splenocyte antibody, Call.8H2, recognized and immunoprecipitated a fourth a- 
like canine subunit also capable of association with the 0 2 subunit, but having a 
unique molecular weight arid restricted in expression to a subset of differentiated 
tissue macrophages. Antibodies generated by immunization of hamsters with 
» murine dendritic cells resulted in two anti-integrin antibodies [Metlay, et al, 
J.Exp.Med. 777:1753-1771 (1990)]. One antibody, 2E6, immunoprecipitated a 
predominant heterodimer with subunits having approximate molecular weights of 
180 kD and 90 kD in addition to minor bands in the molecular weight range of 
150-160 kD. .The second antibody, N418, precipitated another apparent 
heterodimer with subunits having approximate molecular weights of 150 kD and 
90 Kd. Based oh cellular adhesion blocking studies, it was hypothesized that 
antibody 2E6 recognized a murine counterpart to human CD18. While the 
molecular weight of theN418 antigen suggested recognition of a murine homolog 
to human CDllc/CD18, further analysis indicated that the murine antigen 
exhibited a tissue distribution pattern which was inconsistent with that observed 
for human CD11C/CD18. 

The antigens recognized by ihe canine Cal 1 . 8H2 antibody and the 
murine N418 antibody could represent a variant species {e.g., a glycosylation or 
splice variant) of a previously identified canine or murine a subunit. 
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Alternatively, these antigens may represent unique canine and murine integrin a 
subunits. In the absence of specific information regarding primary structure, 
these alternatives cannot be distinguished. 

In humans, CD 1 la/CD 18 is expressed on all leukocytes. 
5 CD lib/CD 18 and CDllc/CD18 are essentially restricted to expression on 
monocytes, granulocytes, macrophages and natural killer (NK) cells, but 
CDllc/CDIS is also detected on some B-cell types. In general, CDlla/CD18 
predominates on lymphocytes, CDllb/CD18 on granulocytes and CDllc/CD18 
on macrophages [see review, Arnaout, Blood 75:1037-1050 (1990)]. Expression 
10 of the a chains, however, is variable with regard to the state of activation and 
differentiation of the individual cell types [See review, Larson and Springer, 
- Immunol.Rev. 114:181-217(1990).] 

The involvement of the 0 2 integrins in human immune and 
inflammatory responses has been demonstrated using monoclonal antibodies which 
15 are capable of blocking integrin-associated cell adhesion, for example, 
CDlla/CD18, CDllb/CDI8 and CDllc/CD18 actively participate in natural 
killer (NK) cell binding to lymphoma and adenocarcinoma cells [Patarroyo, et al , 
ImmunoLRev. 1 14:67-108 (1990)], granulocyte accumulation [Nourshargh, et al , 
JJmmunoL 742:3193-3198 (1989)3, granuiocyte-independent plasma leakage 
10 [Arfors, et al, Blood d#;338-340 (1987)], chemotactic response of stimulated 
leukocytes [Arfors, et al, supra] and leukocyte adhesion to vascular endothelium 
[Price, etal t JJmmunoL iJP:4174-4177 (1987) and Smith, et al t J.ClinJnvesL 
&J.2008-2017 (1989)], The fundamental role of 0 2 integrins in immune and 
inflammatory responses is made apparent in the clinical syndrome referred to as 
!5 leukocyte adhesion deficiency (LAD), wherein clinical manifestations include 
recurrent and often life threatening bacterial infections. LAD results from 
heterogeneous mutations in the 0 2 subunit [Kishimoto, et al, Cell 50: 193-202 
(1987)] and the severity of the disease state is proportional to the degree of the 
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deficiency in j8 2 subunit expression. Formation of the complete integrin 
heterodimer is impaired by the £ 2 mutation [Kishimoto, et al t supra]. 

Interestingly, at least one antibody specific for CD 18 has been 
shown to inhibit human immunodeficiency virus type-1 (HTV-1) syncytia 
5 . formation in vitro, albeit the exact mechanism of this inhibition is unclear 
[Hildreth and Orentas, Science 244:1075-1078 (1989)]. This observation is 
consistent with the discovery that a principal counterreceptor of CDlla/CD18, 
ICAM-1, is also a surface receptor for the major group of rhinovirus serotypes 
[Gicvt, et aL, Cell 56;&39 (19S9)]. 

10 The significance of j8 2 integrin binding activity in human immune 

and inflammatory responses underscores the necessity to develop a more complete 
understanding of this class of surface proteins. Identification of yet unknown 
members of this subfamily, as well as their counterreceptors, and the generation 
of monoclonal antibodies or other soluble factors which can alter biological 

15 activity of the £ 2 jntegrins will provide practical means for therapeutic 
intervention in. /S 2 integrin r related immune and inflammatory responses. 

Brief Descri ption of the Invention 
In one aspect, the present invention provides novel purified and 
isolated polynucleotides (e^,, DNA and RNA transcripts, both sense and anti- 

20 sense strands) encoding a novel human £ 2 integrin a subunit, a d , and variants 
thereof (i.e., deletion, addition or substitution analogs) which possess binding 
and/or immunological properties inherent to a d „ Preferred DNA molecules of the 
invention include cDNA, genomic DNA and wholly or partially chemically 
synthesized DNA molecules. A presently preferred polynucleotide is the DNA 

25 as set forth in SEQ ID NO: 1, encoding the polypeptide of SEQ ID NO: 2. Also 
provided are. recombinant plasmid and viral DNA constructions (expression 
constructs) which include a d encoding sequences, wherein the a d encoding 
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sequence is operatively linked to a homologous or heterologous transcriptional 
regulatory element or elements. 

Also provided by the present invention are isolated and purified 
mouse and rat polynucleotides which exhibit homology to polynucleotides 
encoding human a d . A preferred mouse polynucleotide is set forth in SEQ ID 
NO: 52; a preferred rat polynucleotide is set forth in SEQ ID NO: 54. 

As another aspect of the invention, prokaryotic or eukaryotic host 
cells transformed or transfected with DNA sequences of the invention are 
provided which express a d polypeptide or variants thereof. Host cells of the 
invention are particularly useful for large scale production of a d polypeptide, 
which can be isolated from either the host cell itself or from the medium in which 
the host cell is grown. Host cells which express a A polypeptide on their 
extracellular membrane surface are also useful as immunogens in the production 
of or d -specific antibodies. Preferably, host cells transfected with a d will be co- 
15 transfected to express a /3 2 integrin subunit in order to allow surface expression 
of the heterodimer. 

Also provided by the present invention are purified and isolated a d 
polypeptides, fragments and variants thereof. Preferred a d polypeptides are as 
set forth in SEQ ID NO: 2. Novel a d products of the invention may be obtained 
as isolates from natural sources, but, along with « d variant products, are 
preferably produced by recombinant procedures involving host cells of the 
invention. Completely glycosylated, partially glycosylated and wholly de- 
glycosylated forms of the a d polypeptide may be generated by varying the host 
cell selected for recombinant production and/or post-isolation processing. Variant 
<*d polypeptides of the invention may comprise water soluble and insoluble a A 
polypeptides including analogs wherein one or more of the amino acids are 
deleted or replaced: (1) without loss, and preferably with enhancement, of one or 
more biological activities or immunological characteristics specific for a d ; or (2) 
with specific disablement of a particular ligand/receptor binding or signalling 



20 
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function. Fusion polypeptides are also provided, wherein a d amino acid 
sequences are expressed contiguously with amino acid sequences from other 
polypeptides. Such fusion polypeptides may possess modified biological, 
biochemical, and/or immunological properties in comparison to wild-type a d . 
5 Analog polypeptides including additional amino acid (e.g., lysine or cysteine) 
residues that facilitate multimer formation are contemplated. 

Also comprehended by the present invention are polypeptides and 
other non-peptide molecules which specifically bind to a d . Preferred binding 
molecules include antibodies (e.g., monoclonal and polyclonal antibodies), 
0 counterreceptors (e.g. , membrane-associated and soluble forms) and other ligands 
(e. g., naturally occurring- or synthetic molecules), including those which 
competitively bind a d in the presence of a d monoclonal antibodies and/or specific 
counterreceptors: Binding molecules are useful for purification of a d polypeptides 
and identifying cell typfes which express or d . Binding molecules are also useful for 
5 modulating (i.e., inhibiting; blocking or stimulating) of in vivo binding and/or 
signal transduction activities of a d . 

Assays to identify a d binding molecules are also provided, 
including immobilized ligand binding assays, solution binding assays, scintillation 
proximity assays, di-hybrid screening assays, and the like. 
} In vitro Assays for identifying antibodies or other compounds that 

modulate the activity of a d may involve, for example, immobilizing a d or a 
natural ligand to which of d binds, detectaibly labelling the nonimmobilized binding 
partner, incubating the binding partners together and determining the effect of a 
test compound on the amount of label bound wherein a reduction in the label 
> bound in the presence of the test compound compared to the amount of label 
bound in the absence of the test compound indicates that the test agent is an 
inhibitor of a d binding. - 

Another type of assay for identifying compounds that modulate the 
interaction between a d and a ligand involves immobilizing a d or a fragment 
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thereof on a solid support coated (or impregnated with) a fluorescent agent, 
labelling the ligand with a compound capable of exciting the fluorescent agent, 
contacting the immobilized a d with the labelled ligand in the presence and absence 
of a putative modulator compound, detecting light emission by the fluorescent 
5 agent, and identifying modulating compounds as those compounds that affect the 
emission of light by the fluorescent agent in comparison to the emission of light 
by the fluorescent agent in the absence of a modulating compound. Alternatively, 
the o. d ligand may be immobilized and a d may be labelled in the assay. 

Yet another method contemplated by the invention for identifying 
10 compounds that modulate the interaction between a d and a ligand involves 
transforming or transfecdng appropriate host cells with a DNA construct 
comprising a reporter gene under the control of a promoter regulated by a 
transcription factor having a DNA-binding domain and an activating domain, 
expressing in. the host cells a first hybrid DNA sequence encoding a first fusion 
15 of part or all of a d and either the DNA binding domain or the activating domain 
of the transcription factor, expressing in the host cells a second hybrid DNA 
sequence encoding part or all of the ligand and the DNA binding domain or 
activating, domain of the transcription, factor which is not incorporated in the first 
fusion, evaluating the effect of a putative modulating compound on the interaction 
20 between a d and the ligand by detecting binding of the ligand to a d in a particular 
, , host cell by measuring the production of reporter gene product in the host cell in 
the presence or absence of the putative modulator, and identifying modulating 
. compounds as those compounds altering production of the reported gene product 
in comparison to production of the reporter gene product in the absence of the 
25 modulating compound. . Presently preferred for use in the assay are the lexA 
promoter, the lexA DNA binding domain, the GAL4 transactivation domain, the 
/flcZ.reporter gene, and a yeast host cell. 

A modified version of the foregoing assay may be used in isolating 
a polynucleotide encoding a protein that binds to « d by transforming or 
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transfecting appropriate host cells with a DNA construct comprising a reporter 
gene under the control of a promoter regulated by a transcription factor having 
a DNA-binding domain and an activating domain, expressing in the host cells a 
first hybrid DNA sequence encoding a first fusion of part or all of a d and either 
5 the DNA binding domain or the activating domain of the transcription factor, 
expressing in the host cells a library of second hybrid DNA sequences encoding 
second fusions of part or all of putative a d binding proteins and the DNA binding 
domain or activating domain of the transcription factor which is not incorporated 
in the first fusion, detecting binding of an a d binding protein to a d in a particular 

10 host cell by detecting the production of reporter gene product in the host cell, and 
isolating second hybrid DNA sequences encoding a d binding protein from the 
particular host cell. 

Hybridomfc cell lines which produce antibodies specific for a d are 
also comprehended by the invention. Techniques for producing hybridomas which 

15 secrete monbclonal antibodies are well known in the art. Hybridoma cell lines 
may be generated after immunizing an animal with purified a d , variants of a d or 
cells which express a d or a variant thereof on the extracellular membrane surface. 
Immunbgen cell types include cells which express ot d in vivo, or transfected 
prokaryotic or eukaryotic cell lines which normally do not normally express a d 

20 in vivo. ■ .■ v ■ 

The value of the information contributed through the disclosure of 
the DNA and amino acid sequences of a d is manifest. In one series of examples, 
the disclosed <x d CDNA sequence makes possible the isolation of the human oc d 
genomic DNA sequence, including transcriptional control elements for the 

25 genomic sequence. Identification of cc d allelic variants and heterologous species 
(e.g., rat or mouse) DNAs is also comprehended. Isolation of the human a d 
genomic DNA and heterologous species DNAs can be accomplished by standard 
DNA/DNA hybridization techniques, under appropriately stringent conditions, 
using all or part of the ot d cDNA sequence as a probe to screen an appropriate 
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library. Alternatively, polymerase chain reaction (PCR) using oligonucleotide 
primers that are designed based on the known cDNA sequence can be used to 
amplify and identify genomic a d DNA sequences. Synthetic DNAs encoding the 
a d polypeptide, including fragments and other variants thereof, may be produced 
5 by conventional synthesis methods. 

DNA sequence information of the invention also makes possible the 
development, by homologous recombination or "knockout" strategies [see, e.g., 
Kxpccchi, Science 244:12884292 (1989)], to produce rodents that fail to express 
a functional a d polypeptide or that express a variant a d polypeptide. Such rodents 
10 are useful as models for studying the activities of a d and a d modulators in vivo. 

DNA and amino acid sequences of the invention also make possible 
the analysis of a d epitopes which actively participate in counterreceptor binding 
as well as epitopes which may regulate, rather than actively participate in, 
binding . Identi H cation of epitopes which may participate in transmembrane signal 
15 transduction is also comprehended by the invention. 

DNA of the invention is also useful for the detection of cell types 
which express a d polypeptide. Standard DNA/RNA hybridization techniques 
which utilize or^ DNA to detect c/ d RNA may be used to determine the 
constitutive level of a d transcription within a cell, as well as changes in the level 
20 . of transcription in response to internal or external agents. Identification of agents 
;j which modify transcription and/or translation of a d can, in turn, be assessed for 
potential therapeutic or prophylactic value. DNA of the invention also makes 
* possible in situ hybridisation of a d DNA to cellular RNA to determine the cellular 
localization of a d specific messages within complex cell populations and tissues. 
25 DNA of the invention is also useful for identification of non-human 

polynucleotide sequences ,which display homology to human a d sequences. 
Possession of non-human. a d DNA. sequences permits development of animal 
models (including, for example, transgenic models) of the human system. 
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As another aspect of the invention, monoclonal or polyclonal 
antibodies specific for a d may be employed in immunohistochemical analysis to 
localize a& to subcellular compartments or individual cells within tissues. 
Immunohistochemical analyses of this type are particularly useful when used in 
5 combination with in situ hybridization to localize both ct d mRNA and polypeptide 
products of the or d gene. 

Identification of cell types which express a d may have significant 
ramifications for development of therapeutic and prophylactic agents. It is 
anticipated that the products of the invention related to a d can be employed in the 

10 treatment of diseases wherein macrophages are an essential element of the disease 
process. Animal models for many pathological conditions associated with 
macrophage activity have been described in the art. For example, in mice, 
macrophage recruitment ? to sites of both chronic and acute inflammation is 
reported by Juulz, et al., J.Leukocyt€ Biol 54:30-39 (1993). In rats, Adams, et 

15 aL t [Tramplamaiion53dli5AU9(l9^ Transplantation 56:794-799 (1993)] 
describe a model for graft arteriosclerosis following heterotropic abdominal 
cardiac allograft transplantation. Rosenfeld, et ah , [Arteriosclerosis 7:9-23 (1987) 
and Arteriosclerosis 7:24%34 (1987)] describe induced atherosclerosis in rabbits 
fed a cholesterol supplemented diet. Hanenberg, et al, [Diabetologia 32: 126-134 

20 (1989)] report the spontaneous development of insulin-dependent diabetes in BB 
rats. Yamada et al. , [Gastroenterology 104:159-111 (1993)] describe an induced 
inflammatory bowel disease, chronic granulomatous colitis, in rats following 
injections of streptococcal peptidoglycan-polysaccharide polymers. Cromartie, et 
al, [J.ExpMed. 146: 1585-1602 (1977)] and Schwab, et al, [Infection and 

25 - Immunity 5P:4436-4442 (1991)] report that injection of streptococcal cell wall 
protein into rats results in an arthritic condition characterized by inflammation of 
peripheral joints and subsequent joint destruction. Finally, Huitinga, et al., 
[Eur J. Immunol 23:709-715 (1993) describe experimental allergic 
encephalomyelitis, a model for multiple sclerosis, in Lewis rats. In each of these 
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models, a d antibodies, other a d binding proteins, or soluble forms of a d are 
utilized to attenuate the disease state, presumably through inactivation of 
macrophage activity. 

Pharmaceutical compositions for treatment of these and other 
5 disease states are provided by the invention. Pharmaceutical compositions are 
designed for the purpose of inhibiting interaction between a d and its ligand(s) and 
include various soluble and membrane-associated forms of a d (comprising the 
entire a d polypeptide, or fragments thereof which actively participate in a d 
binding), soluble and membrane-associated forms of a d binding proteins 
10 (including antibodies^ ligands, and the like), intracellular or extracellular 
modulators of aj binding activity, and/or modulators of a d and/or a d -ligand 
polypeptide expression, including modulators of transcription, translation, post- 
translatipnal processing and/or intracellular transport. The invention also 
comprehends methods for treatment of disease states in which a d binding is 
15 implicated, wherein a patient. suffering from said disease state is provided an 
. amount of a pharmaceutical composition of the invention sufficient to modulate 
levels of a d binding. The method of treatment of the invention is applicable to 
disease states such as, but not limited to, Type I diabetes, atherosclerosis, 
: , multiple sclerosis, asthma, psoriasis, and rheumatoid arthritis. 

20 Brief De scription of the Drawing 

Numerous other aspects and advantages of the present invention 
will be apparent upon consideration of the following description thereof, reference 
being made to the drawing wherein: 

Figure 1 A through ID comprises an alignment of the human amino 
25 acid sequences of CD1 lb (SEQ ID NO: 3), CD1 1c (SEQ ID NO: 4) and a d (SEQ 
ID NO: 2). • < 
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Detailed Description of the Invention 
The present invention is illustrated by the following examples 
relating to the isolation of a cDNA clone encoding a d from a human spleen 
cDNA library. More particularly, Example 1 illustrates the use of anti-canine 
5 a TtA\ anybody in an attempt to detect a homologous human protein. Example 2 
details purification of canine <xjmi N-tenninal sequencing of the polypeptide 
to design oligonucleotide primers for PCR amplification of the canine ctjMi g ene - 
Example 3 addresses large scale purification of canine <xjmi for internal 
sequencing in order to design additional PCR primers. Example 4 describes use 

10 of the PCR and internal sequence primers to amplify a fragment of the canine 
a-j^j gene. Example 5 addresses cloning of the human a d -encoding cDNA 
sequence. Example- 6- describes Northern blot hybridization analysis of human 
tissues and cells for expression of a d mRNA. Example 7 details the construction 
of human a d expression plasmids and transfection of COS cells with the resulting 

IS plasmids. Example 8 addresses ELIS A analysis of a d expression in transfected 
COS cells. Example 9 describes FACS analysis of COS cells transfected with 
human c* d expression plasmids. Example 10 addresses immunoprecipitation of 
CD18 in association with a d in co-transfected COS cells. Example 11 relates to 
stable transfeition of a d expression constructs in Chinese hamster ovary cells. 

20 Example 12 addresses CD 18 -dependent binding of a d to the intercellular adhesion 
molecule, ICAM-R. Example 13 describes scintillation proximity screening 
assays to identify- inhibitor of a d ligand/anti-ligand binding interactions. 
Example 14 addresses construction of expression plasmids which encode soluble 
forms of a d . Example < 15 relates to production of a d -specific monoclonal 

25 antibodies. Example 16 describes analysis of t* d tissue distribution using 
polyclonal antiserum. Btample 17 describes isolation of rat cDNA sequences 
which show homology to human a d gene sequences. Example 18 relates to 
construction of rat a d I domain expression plasmids, including I domain/IgG 
fusion proteins, and production of monoclonal antibodies to I domain fusion 
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protcins. Example 19 addresses isolation of mouse cDNA sequences which show 
homology to human a d gene sequences. Example 20 describes isolation of 
additional mouse a d cDNA clones used for conformational sequence analysis. 
Example 21 relates to in situ hybridbation analysis of various mouse tissues to 
5 determine tissue and cell specific expression of the putative mouse homolog to 
human a d . Example 22 describes generation of expression constructs which 
encode the putative mouse homolog of human a d . Example 23 addresses design 
of a "knock-out" mouse wherein the gene encoding the putative mouse homolog 
of human ot d is disrupted. Example 24 describes isolation of rabbit cDNA clones 
10 which show homology to humari <x d encoding sequences. Example 25 describes 
animal models which xeseroble human disease states wherein modulation of a d is 
assayed for therapeutic capabilities. 

Example 1 

Attempt to Detect a Human Homolop of Canine Qf- fMl 

15 The monoclonal antibody Call. 8H2 [Moore, et al . , supra] specific 

for canine ajMi was tested for cross-reactivity on human peripheral blood 
: leukocytes in an attempt to identify a human homolog of canine Cell 
preparations (typically 1 x 10 6 cells) were incubated with undiluted hybridoma 
supernatant or a purified mouse IgG-negative control antibody (10 pig/ml) on ice 

20 in the presence of 0. 1 % sodium azide. Monoclonal antibody binding was detected 
by subsequent incubation with FTTC-conjugated horse anti-mouse IgG (Vector 
Laboratories, Burlingame, CA) at 6 fig/mL Stained cells were fixed with 2% w/v 
paraformaldehyde in phosphate buffered saline (PBS) and were analyzed with a 
Facstar Plus fluorescence-activated cell sorter (Becton Dickinson, Mountain View, 

25 CA). Typically, 10,000 cells were analyzed using logarithmic amplification for 
fluorescence intensity. 

The results indicated that Ca 1 1 . 8H2 did not cross-react with surface 
proteins expressed on human peripheral blood leukocytes, while the control cells, 



BNSOOCID: <WO_9517412Al_l_> 



WO 95/17412 



PCT/OS94/14832 



- 15 - 

neoplastic canine peripheral blood lymphocytes, were essentially all positive for 
°TM1- 

Because the monoclonal antibody Cal L8H2 specific for the canine 
a subunit did not cross react with a human homolog, isolation of canine <*tmi 
5 DNA was deemed a necessary prerequisite to isolate a counterpart human gene 
if one existed. 

Example 2 

Affinity Pvrifipation Qf Conine a TM1 For N-Tf*minal Sequencing 

Canine ctjwi was affinity purified in order to determine N-terminal 

10 amino acid sequences for oligonucleotide probe/primer design. Briefly, anti-ay^ 
monoclonal antibody Gal L8H2 was coupled to Affigel 10 chromatographic resin 
(BioRad, Hercules, CA) and protein was isolated by specific antibody-protein 
interaction. Antibody was conjugated to the resin, according to the BioRad 
suggested protocol, at a concentration of approximately 5 mg antibody per ml of 

15 resin. Following the conjugation reaction, excess antibody was removed and the 
resin blocked with three volumes of 0.1 M ethanolamine. The resin was then 
washed with thirty, column volumes of phosphate buffered saline (PBS). 

Twenty-five grams of a single dog spleen were homogenized in 250 
ml of buffer containing 0.32 M sucrose in 25 mM Tris-HCl, Ph 8.0, with 

20 protease inhibitors. Nuclei and cellular debris were pelleted with centrifugation 
at 1000 g for 15 minutes. Membranes were pelleted from the supernatant with 
centrifugation at 100,000 g for 30 minutes. The membrane pellet was 
resuspended in 200 ml lysis buffer (50 mM NaCl, 50 mM borate, pH 8.0, with 
2% NP-40) and incubated for 1 hour on ice. Insoluble material was then pelleted 

25 by centrifugation at 100,000 g for 60 minutes. Ten milliliters of the cleared 
lysate were transferred to a 15 ml polypropylene tube with 0.5 ml Call.8H2- 
conjugated Affigel 10 resin described above. The tube was incubated overnight 
at 4 * C with rotation and the resin subsequently washed with 50 column volumes 
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D-PBS. The resin was then transferred to a microfuge tube and boiled for ten 
minutes in 1 ml Laemmli (non-reducing) sample buffer containing 0. 1 M Tris- 
HC1, pH 6.8, 2% SDS, 20% glycerol and 0.002% bromophenol blue. The resin 
was pelleted by centrifugation and discarded; the supernatant was treated with 
5 1/15 volume j8-mercaptoethanol (Sigma, St. Louis, MO) and run on a 7% 
polyacrylamide gel . The separated proteins were transferred to Immobilon PVDF 
membrane (Millipore, Bedford, MA) as follows. 

The gels were- washed once in deionized, Millipore-filtered water 
and equilibrated for 15-45 minutes in 10 mM 3-[cyclohexylamino]-l- 

10 propanesulfonic acid (CAPS) transfer buffer, pH 10.5, with 10% methanol. 
Immobilon membranes were moistened with methanol, rinsed with filtered water, 
and equilibrated for 15-30 minutes in CAPS transfer buffer. The initial transfer 
was carried out using a Biorad transfer apparatus at 70 volts for 3 hours. The 
Immobilon membrane was removed after transfer and stained in filtered 0.1% 

15 R250 Coomassie stain for, 10 minutes. Membranes were destained in 50% 
methanol/ 10% acetic acid three times, ten minutes each time. After destaining, 
the membranes were washed in filtered water and air-dried. 

Protein bands of approximately ,150 kD, 95 kD, 50 kD and 30 kD 
were detected. Presumably the 50 kD and 30 kD bands resulted from antibody 

20 contamination. N-terminal sequencing was then attempted on both the 150 kD 
and 95 kD bands, but the 95 kD protein was blocked, preventing sequencing. 
The protein band of 150 kD was excised from the membrane and directly 
sequenced with an Applied Biosystems (Foster City, CA) Model 473A protein 
sequencer according to the manufacturer's instructions. The resulting amino acid 

25 sequence is set in SEQ ID NO: 5 using single letter amino acid designations. 

FNLDVEEPMVFQ (SEQ ID NO: 5) 
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Tne identified sequence included the FNLD sequence characteristic of a subunits 
of the integrin family [Tamura, et aL, J.Cell.Biol. 777:1593-1604 (1990)]. 

Primer Design and Attempt to Amplify Canine a -pii! Sequences 

From die N-terminal sequence information, three oligonucleotide 
5 probes were designed for hybridization: a) Tommer," a fully degenerate 
oligonucleotide; b) Tatmer," a partially degenerate oligonucleotide; and c) 
"Guessmer," a nondegenerate oligonucleotide based on mammalian codon usage. 
These probes are set out below as SEQ ID NOS: 6, 7 and 8, respectively. 
Nucleic acid symbols are in accordance with 37 C.F.R. §1.882 for these and all 
10 other nucleotide sequences herein. 

5 '-TTYAA YYTGGAYGlNGARGARCCNATGGTOTTYCA-a^SEQ ID NO: 6) 
5 # -TTCAACGTGGACGTGGAGGAGCCCATGGTGTTCCAA<SEQ ID NO: 7) 
5 '-TTCAACCTGGACGTNGAASANCCCATGGTCTTCCAA^BEQ ID NO: 8) 

Based on sequencing data, no relevant clones were detected using 
15 these oligonucleotides in several low stringency hybridizations to a canine 
spleen/peripheral blood macrophage cDNA library cloned into XZAP (Stratagene, 
La Jolla, CA). " 

Four other oligonucleotide primers, designated 5 'Deg, 5 'Spec, 
3 'Deg and 3 'Spec (as set out in SEQ ID NOS: 9, 10, 11 and 12, respectively, 
20 wherein : Deg indicates degenerate and Spec indicates non-degenerate) were 
subsequently designed based on the deduced N-terminal sequence for attempts to 
amplify canine a T ^ 1 sequences by PCR from phage library DNA purified from 
plate lysates of the Stratagene library described above. 

5 '-TTYAA Y YTNGA YGTNG ARGARCC-3 ' (SEQ ID NO: 9) 

25 5 '-TTYAAYYTGGACGTNGAAGA-3 ' (SEQ ID NO: 10) 
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5 '-TGRAANACC ATNGGYTC-3 ' (SEQ ID NO: 1 1) 

5 '-TTGGAAGACCATNGGYTC-3 ' (SEQ ID NO: 12) 

The ctjMi oligonucleotide primers were paired with T3 or T7 
vector primers, as set out in SEQ ID NOS: 13 and 14, respectively, which 
5 hybridize to sequences flanking the polylinker region in the Bluescript phagemid 
found in XZAP. 

5 '"ATTAACCCTCACTAAAG-3 ' (SEQ ID NO: 13) 

5 '-AATACGACTCACTATAG-3 ' (SEQ ID NO: 14) 

The PCR amplification was carried out in Tag buffer (Boehringer Mannheim, 

10 Indianapolis, IN) containing magnesium with 150 ng of library DNA, 1 /xg of 
each primer, 200 /sM dNTPs and 2.5 units Taq polymerase (Boehringer 
Mannheim) and the products were separated by electrophoresis on a 1 % agarose 
gel in Tris- Acetate EDTA (TAB) buffer with 0.25 /ig/ml ethidium bromide. DNA 
was transferred to a Hybond (Amersham, Arlington Heights, IL) membrane by 

15 wicking overnight in 10X SSPE. After transfer, the immobilized DNA was 
denatured with 0.5 M NaOH with 0.6 M NaCl, neutralized with 1.0 M Tris-HCl, 
pH 8.0, in 1.5 M NaCl, and washed with 2X SSPE before UV crosslinking with 
. a StrataJinker (Stratagene) crosslinking apparatus. The membrane was incubated 
in pfehybridization buffer (5X SSPE, 4X Denhardts, 0.8% SDS, 30% formamide) 

20 for 2 hr at 50 *C with agitation. 

Oligonucleotide probes 5'Deg, 5 'Spec, 3'Deg and 3 'Spec (SEQ 
ID NOS: 9, 10, 11 and 12, respectively) were labeled using a Boehringer 
Mannheim kinase buffer with 100-300 /*Ci 7P 32 -dATP and 1-3 units of 
polynucleotide kinase for 1-3 hr at 37* C. Unincorporated label was removed with 

25 Sephadex G-25 fine* (Pharmacia, Piscataway, NJ) chromatography using 10 mM 
Tris-HCl, pH 8.0, I mM EDTA (TE) buffer and the flow-through added directly 
to the prehybridization solution. Membranes were probed for 16 hr at 42 X with 



BNSOOCID: <WO_9517412A1 J_> 



U WO 95/17412 PCT/US94/14832 



- 19- 

agitation and washed repeatedly, with a final stringency wash of IX SSPE/0.1 % 
SDS at 50 # for 15 min. The blot was then exposed to Kodak X-Omat AR film 
for 1-4 hours at -8(TC. 

The oligonucleotides 5'Deg, 5 'Spec, 3'Deg and 3 'Spec only 
5 hybridized to PCR products from the reactions in which they were used as 
primers and failed to hybridize as expected to PCR products from the reactions 
in which they were not used as primers. Thus, it was concluded that none of the 
PCR products were specific for cc TM1 because no product hybridized with all of 
the appropriate probes. 

10 ]Ejc^mp^3 

Large Scale Affinity Purification Of Canine For Internal Sequencing 

In order to provide additional amino acid sequence for primer 
design, canine ixfMi was purified for internal sequencing. Three sections of 
frozen spleen (approximately 50 g each) and frozen cells from two partial spleens 

15 from adult dogs were used to generate protein for internal sequencing. Fifty 
grams of spleen were homogenized in 200-300 ml borate buffer with a Waring 
blender. The homogenized material was diluted with 1 volume of buffer 
containing 4 % NP-40; and the mixture then gently agitated for at least one hour. 
The resulting lysate was cleared of large debris by centrifugation at 2000 g for 20 

20 min, and then filtered through either a Coming (Corning, NY) prefilter or a 
Corning 0.8 micron filter; The lysate was further clarified by filtration through 
the Corning 0.4 micron filter system, 

Splenic lysate and the antibody-conjugated Affigel 10 resin 
described in Example 2 were combined at a 150: 1 volume ratio in 100 ml aliquots 

25 and incubated overnight at 4'C with rocking. The lysate was removed after 
centrifugation at 1000 g for 5 minutes, combined with more antibody-conjugated 
Affigel 10 resin and incubated overnight as above. The absorbed resin aliquots 
: were then combined and washed with 50 volumes D-PBS/0. 1 % Tween 20 and the 
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resin transferred to a 50 ml Biorad column. Adsorbed protein was eluted from 
the resin with 3-5 volumes of 0. 1 M glycine (pH 2.5); fractions of approximately 
900 pi were collected and neutralized with 100 fil 1 M Tris buffer, pH 8.0. 
Aliquots of 15 pi were removed from each fraction and boiled in an equal volume 
5 of 2X Laemmli sample buffer with 1/15 volume 1 M dithiothreitol (DTT). These 
samples were electrophoresed on 8 % Novex (San Diego, CA) polyacrylamide gels 
and visualized either by Coomassie siain or by silver stain using a Daiichi kit 
(Enprotech, Natick, MA) according to the manufacturer's suggested protocol. 
Fractions which contained the largest amounts of protein were combined and 
10 concentrated by vacuum. The remaining solution was diluted by 50% with 
reducing Laemmli sample buffer and run or. 1.5 mm 7% polyacrylamide gels in 
Tris-glycine/SDS buffer. Protein was transferred from the gels to Immobilon 
membrane by the procedure described in Example 2 using the Hoefer transfer 
apparatus. 

15 The protein bands corresponding to canine were excised from 

10 PVDF membranes and resulted in approximately 47 fig total protein. The 
bands were destained in 4 ml 50% methanol for 5 minutes, air dried and cut into 
1x2 mm pieces. The membrane pieces were submerged in 2 ml 95% acetone 
at 4*C for 30 minutes with occasional vortexing and then air dried. 

20 Prior to proteolytic cleavage of the membrane bound protein, 3 mg 

of cyanogen bromide (CNBr) (Pierce, Rockford, IL) were dissolved in 1.25 ml 
70% formic acid. This solution was then added to a tube containing the PVDF 
membrane pieces and the tube incubated in the dark at room temperature for 24 
hours. The supernatant (S 1) was then removed to another tube and the membrane 

25 pieces washed with 0.25 ml 70% formic acid. This supernatant (S2) was removed 
and added to the previous supernatant (SI). Two milliliters of Milli Q water were 
added to the combined supernatants (S 1 and S2) and the solution lyophilized. The 
PVDF membrane pieces were dried under nitrogen and extracted again with 1.25 
ml 60% acetonitrile, 0.1% tetrafluoroacetic acid (TFA) at 42 X for 17 hours. 
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This supernatant (S3) was removed and the membrane pieces extracted again with 
1.0 ml 80% acetonitrile with 0.08% TFA at 42* C for 1 hour. This supernatant 
(S4) was combined with the previous supernatants (SI, S2 and S3) and vacuum 
dried. 

5 The dried CNBr fragments were then dissolved in 63 p\ 8 M urea, 

0.4 M NH 4 HC0 3 . The fragments were reduced in 5 fil 45 mM dithiothreitol 
(DTT) and subsequently incubated at 50* C for 15 minutes. The solution was then 
cooled to room temperature and the fragments alkylated by adding 5 ii\ 100 mM 
ipdoapetamide (Sigma, St Louis, MO). Following a 15 minute incubation at 

10 room temperature, the sample was diluted with 187 /tl Milli Q water to a final 
urea concentration of 2.0 M;. Trypsin (Worthington, Freehold, NJ) was then 
added: at a ratio of 1:25 (w:w) of enzyme to protein and the protein digested for 
24 hours at 37* C. Digestion was terminated with addition of 30 fil TFA. 

The protein fragments were then separated with high performance 

15 liquid chromatography (HPLC) on a Waters 625 LC system (Millipore, Milford, 
MA) using a 2.1 x 250 mm, 5 micron Vydac C-18 column (Vydac, Hesperia, 
CA) equilibrated in 0.05% TFA and HPLC water (buffer A). The peptides were 
eluted with increasing concentration of 80% acetonitrile in 0.04% TFA (buffer B) 
with a gradient of 38-75 % buffer B for 65-95 minutes and 75-98% buffer B for 

20 95-105 minutes. Peptides were fractionated at a flow rate of 0.2 ml/minute and 
detected at 210 nm. 

Following fractionation, the amino acid sequence of the peptides 
was analyzed by automated Edman degradation performed on an Applied 
Biosystems Model 437A protein sequencer using the manufacturer's standard 

25 cycles and the Model 610A Data Analysis software program, Version 1.2.1. All 
sequencing reagents were supplied by, Applied Biosystems. The amino acid 
sequences of seven of the eight internal fragments are set out below wherein "X" 
indicates the identity of the amino acid was not certain. 
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5 



VFQEXGAGFGQ 

LYDXVAATGLXQPI 

PLEYXDVIPQAE . 

FQEGFSXVLX 

TSPTFIXMSQENVD 

LWGAPLEVVAVXQTGR 



(SEQ ID NO: 15) 
(SEQ ID NO: 16) 
(SEQ ID NO: 17) 
(SEQ ID NO: 18) 
(SEQ ID NO: 19) 
(SEQ ID NO: 20) 
(SEQ ID NO: 21) 



LDXKPXDTA 



Primer Design 

. One internal amino acid sequence (set out in SEQ ID NO: 22) 
obtained was then used to design a fully degenerate oligonucleotide primer, 
designated p4(R) as .set out in SEQ ID NO: 23. 



Example 4 

.15 PCR Cloning Of A Canine r nu Fragment 

. , : : The ,5 ; portion of the canine gene was amplified from 

. double-stranded canine splenic cDNA by. PCR. 



One gram of frozen material from a juvenile dog spleen was ground 
in liquid nitrogen on dry ice and homogenized: in 20 ml RNA-Stat 60 buffer (Tel- 
Test B, Inc. Friendswood. TX). Four ml chloroform were added, and the 
solution extracted by centrifugation at 12,000 g for 15 minutes. RNA was 
precipitated from the aqueous layer with 10 ml ethanol. Poly A + RNA was then 
selected on Dynal Oligo dT Dynabeads (Dynal, Oslo, Norway). Five aliquots of 
100 fig total RNA were combined and diluted with an equal volume of 2X binding 



FGEQFSE . . ... 

5 '-RAANCCYTCYTGRAAACTYTC-3 ' 



(SEQ ID NO: 22) 
(SEQ ID NO: 23) 



A. Generation of Double Stranded Canine Spleen cDNA 
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buffer (20 mM Tris-HCl, pH 7.5, 1.0 M LiCl, 1 mM EDTA, 0.1 % SDS). RNA 
was then incubated 5 minutes with the Oligo dT Dynabeads (1.0 ml or 5 mg 
beads for all the samples). Beads were washed with buffer containing 10 mM 
Tris-HCl, pH 7.5, 0.15 M LiCl, 1 mM EDTA and 0.1% SDS, according to the 
5 manufacturer's suggested protocol prior to elution of poly A + mRNA with 2 mM 
EDTA, pH 7.5. Double-stranded cDNA was then generated using the eluted poly 
A + mRNA and the Boehringer Mannheim cDNA Synthesis Kit according to the 
manufacturer's suggested protocol. 

■ B. Isolation of a Partial Canine a TMl cDNA 

10 Oligonucleotide primers 5 'Deg (SEQ ID NO: 9) and p4(R) (SEQ 

ID NO: 23) were employed in a standard PCR reaction using 150 ng double- 
stranded cDNA, 500 ng of each primer, 200 ^jM dNTPs and 1.5 units Tag 
polymerase (Boehringer Mannheim) in Taq buffer (Boehringer Mannheim) with 
magnesium. The resulting products (1 pi of the original reaction) were subjected 

15 to a second round of PCR with the same primers to increase product yield. This 
band was eluted fronva 1% agarose gel onto Schleicher & Schuell (Keene, NH) 
NA45 paper in a buffer containing 10 mM Tris-HCl, pH 8, 1 mM EDTA, 1.5 M 
NaCl at .65 *G, precipitated, and ligated into the pCR^II vector (Invitrogen, San 
Diego, CA) using the TA cloning kit (Invitrogen) and the manufacturer's 

20 suggested protocol. The ligation mixture was transformed by electroporation into 
XL-1 Blue bacteria (Stratagene). One clone, 2.7, was determined to contain 
sequences corresponding to peptide sequences which were not utilized in 
design of the primers. 

Sequencing was performed with an Applied Biosystems 373A DNA 

15 sequencer (Foster City, C A) with a Dye-deoxy terminator cycle sequence kit 
(ABI) in which fluorescent-labeled dNTPs were incorporated in an asymmetric 
PCR reaction [McCabe, "Production of Single Stranded DNA by Asymmetric 
?CR, tt in PCR Protocols: A Guide to Methods and Applications . Innis, et al 
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(eds.) pp. 76-83 Academic Press: New York (1990)] as follows. Samples were 
held at 96°C for 4 minutes and subjected to 25 cycles of the step sequence: 96°C, 
for 15 seconds; 50°C for 1 second; 60°C for 4 minutes. Sequence data was 
automatically down-loaded into sample files on the computer that included 
5 chrornatogram and text files. Tne sequence of the entire insert of clone 2.7 is set 
out in SEQ ID NO: 24. 

Attempts to isolate the full length canine <xjmi cDNA from the 
Stralagene library (as described in Example 2) were unsuccessful. Approximately 
1 x 1 0 6 phage plaques were screened by hybridization under low stringency 

10 conditions using 30% formaxnide with clone 2:7 as a probe, but no positive clones 
resulted. Attempts to amplify relevant sequences downstream from those 
. ■ represented in clone 2:7 using specific oligonucleotides derived from clone 2.7 or 
degenerate primers based on amino acid sequence from other peptide fragments 
paired with a degenerate oligonucleotide based on the conserved a subunit amino 

15 acid motif GFFKR [Tamura, et al , supra] were also unsuccessful. 

Example 5 

Cloning Of A Putative Human Homolog Of Canine <x TMl 

To attempt the isolation of a human sequence homologous to canine 
o£ TM1 the approximately. 1 kb canine fragment from clone 2.7 was used as 
20 a probe. The probe was generated by PCR under conditions described in 
Example 2 using NT2 (as set out in SEQ ID NO: 25) and p4(R) (SEQ ID NO: 
■ t • . 23) primers. 

5 '-GTNTTYCARGARGA YGG^3 * (SEQ ID NO: 25) 

The PCR product was purified using the Qiagen (Chatsworth, GA) Quick Spin kit 
25 and the manufacturer's suggested protocol. The purified DNA (200 ng) was 
labeled with 200 jiCi of 32 PdCTP using the Boehringer Mannheim Random Prime 
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Labelling kit and the manufacturer's suggested protocol. Unincorporated isotope 
was removed with Sephadex G25 (fine) gravity chromatography. The probe was 
denatured with 0.2 N NaOH and neutralized with 0.4 M Tris-HCl, pH 8.0, before 
use. 

5 Colony lifts on Hybond filters (Amersham) of a human spleen 

cDNA library in pCDNA/Amp (Invitrogen, San Diego, CA) were prepared. The 
filters were initially denatured and neutralized as described in Example 2 and 
subsequently incubated in a prehybridization solution (8 ml7filter) with 30% 
formamide at 50 "C with gentle agitation for 2 hours. Labeled probe as described 

10 above was added to this solution and incubated with the filters for 14 hours at 
42 *C. The filters were washed twice in 2X SSC/0, 1 % SDS at 37* C and twice 
in 2X SSC/0.1% SDS at 50* C. Final stringency washes were IX SSC/0.1 % SDS, 
twice at 65'C (IX SSC is 150 mM NaCl, 15 mM sodium citrate, pH 7.0). 
Filters were exposed to Kodak X-Omat AR film for six hours with an intensifying 

15 screen. Colonies; giving signals on duplicate lifts were streaked on LB medium 
with magnesium (LBM)/carbenicillin plates and incubated overnight at 37 'C. 
Resulting streaked colonies were lifted with Hybond filters and these filters were 
treated as above. The} filters were hybridized under more stringent conditions 
with the l kb probe from clone 2.7; labeled as previously described, in a 50% 

20 formamide hybridization solution at 50* C for 3 hours. Probed filters were 
washed with a final stringency of 0.1 X SSC/0.1% SDS at 65 *C and exposed to 
Kodak X-Omat AR film for 2.5 hours at -80 *C with an intensifying screen. 
Positive colonies were identified and cultured in . LBM/carbenicillin medium 
overnight. DNA from the cultures was prepared using the Promega Wizard 

25 miniprep kit according to the manufacturer's suggested protocol and the resulting 
DNA was sequenced. 

The initial screening resulted in 18 positive clones, while the 
secondary screening under more stringent hybridization conditions produced one 
positive clone which was designated 19A2. The DNA and deduced amino acid 
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sequences of the human a d clone 19A2 are set out in SEQ ID NOS: 1 and 2, 
respectively. 

Characteristics Of The Human ot d cDNA and Predicted Polypeptide 

Clone 19A2 encompasses the entire coding region for the mature 
5 protein, plus 48 bases (16 amino acid residues) of the 5' upstream signal 
sequence and 241 bases of 3 ' untranslated sequence which do not terminate in a 
polyadenylation sequence. The core molecular weight of the mature protein is 
predicted to be around 125 kD. Tne extracellular domain is predicted to 
encompass approximately amino acid residues 17 through 1108 of SEQ ID NO: 

10 2. This extracellular region is contiguous with about a 20 amino acid region 
homologous to the human CDIlc transmembrane region (residues 1109 through 
1128 of SEQ ID NO: 2). The cytoplasmic domain comprises approximately 30 
amino acids (about residues 1 129 through 1161 of SEQ ID NO: 2). The protein 
also contains a region (around residues 150 through 352) of approximately 202 

15 amino acids homologous to the I (insertion) domain common to CDlla, CDllb 
and CD 11c [Larson and Springer, supra], a E [Shaw, et al, J.Biol Chem. 
259:6016-6025 (1994)] and in VLA1 and VLA-2, [Tamura, et al, supra]. The 
I domain in other integrins has been shown to participate in ICAM binding 
[Landis, ei al, J.CelLBiol. 720:1519-1527 (1993); Diamond, et al, J.CellBiol 

20 720: 103 1-1043 (1993)], suggesting that a d may also bind members of the ICAM 
family of surface molecules. This region has not been demonstrated to exist in 
any other integrin subunits. 

The deduced amino acid sequence of a d shows approximately 36% 
identity to that of CDlla, approximately 60% identity to CDllb and 

25 approximately 66% identity to CDl lc. An alignment of amino acid sequences for 
(CDllb SEQ ID NO: 3), CDIlc (SEQ ID NO: 4) and a d (SEQ ID NO: 2) is 
presented in Figure 1. 
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The cytoplasmic domains of or subunits in j3 2 integrins are typically 
distinct from one another within the same species, while individual a subunits 
show high degrees of homology across species boundaries. Consistent with these 
observations, the cytoplasmic region of a d differs markedly from CD1 la, CD1 lb, 
5 and CD1 1c except for a membrane proximal GFFKR amino acid sequence which 
has been shown to be conserved among all a integrins [Rojiani, et al 9 
Biochemistry 30: 9859-9866 (1991)]. Since the cytoplasmic tail region of integrins 
has been implicated in "inside out" signaling and in avidity regulation [Landis et 
al, supra], it is possible that a d interacts with cytosolic molecules distinct from 

10 those interacting with CDlla, CDllb, and CDllc, and 5 as a result, participates 
in signaling pathways distinct from those involving other fi 2 integrins. 

The extracellular domain of <* d contains a conserved DGSGS amino 
acid sequence adjacent the I-domain; in CDllb, the DGSGS sequence is a 
metal-binding region required for ligand interaction [Michishita, et al. Cell 

15 72:857-867 (1993)]. Three additional putative cation binding sites in CD1 lb and 
CD 11c are conserved in the a d sequence at amino acids 465-474, 518-527, and 
592-600 in clone 19A2 (SEQ ID NO: 1). The a d I-domain is 36%, 62%, and 
57% identical to the corresponding regions in CDlla, CDllb, and CDllc, 
respectively , and the relatively low sequence homology in this region suggests that 

20 a d may interact with a set of extracellular proteins distinct from proteins with 
which other knowp 0 2 integrins interact. Alternatively, the affinity of a d for 
known 0 2 integrin ligands, for example, ICAM-1, ICAM-2 and/or ICAM-R, may 
be distinct from that demonstrated for the other j5 2 integrin/ICAM interactions. 
[See Example 12.] : . 
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Example 6 

Northern Analysis of Human a d Expression in Tissues 

In order to determine the relative level of expression and tissue 
specificity of or d , Northern analysis was performed using fragments from clone 
5 19A2 as probes. Approximately 10 fig of total RNA from each of several human 
tissues or cultured cell lines were loaded on a formaldehyde agarose gel in the 
presence of I pg of ethidium bromide. After electrophoresis at 100 V for 4 hr, 
the RNA was transferred to a nitrocellulose membrane (Schleicher & Schuell) by 
wicking in 10X SSC overnight. The membrane was baked 1.5 hr at 80*C under 

10 vacuum. Prehybridization solution containing 50% formamide in 3-(N- 
morpholino)propane sulfonic acid (MOPS) buffer was used to block the membrane 
for 3 hr at 42 *C. Fragments of clone 19A2 were labeled with the Boehringer 
Mannheim Random Prime kit according to the manufacturer's instructions 
including .both «P 32 dCIP and aP 32 dTTP. Usiincorporated label was removed on 

15 a Sephadex G25 column in TE buffer. The membrane was probed with 1.5 x 10 6 
counts per ml of prehybridization buffer. The blot was then washed successively 
with 2X SSC/0.1% SDS at room temperature, 2X SSC/0.1% SDS at 42 'C, 2X 
SSC/0.1% SDS at 50-C, IX SSC/0.1% SDS at 5<TC, 0.5X SSC/0.1% SDS at 
50 # C and 0. IX SSC/0. 1 % SDS at SOX. The blot was then exposed to film for 

20 19 hr. 

Hybridization using a BstXl fragment from clone 19A2 
(corresponding to nucleotides 2011 to 3388 in SEQ ID NO: 1) revealed a weak 
signal in the approximately 5 kb range in liver, placenta, thymus, and tonsil total 
RNA. No signal was detected in kidney, brain or heart samples. The amount of 
25 RNA present in the kidney lane was minimal, as determined with ethidium 
. bromide staining. 

When using a second fragment of clone 19A2 (encompassing the 
region from bases 500 to 2100 in SEQ ID NO: 1), RNA transcripts of two 
- , different sizes were detected in a human multi-tissue Northern (MTN) blot using 
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polyA + RNA (Clontech). An approximately 6.5 kb band was observed in spleen 
and skeletal muscle, while a 4.5 kb band was detected in lung and peripheral 
blood leukocytes. The variation in sizes observed could be caused by tissue 
specific polyadenylation, cross reactivity of the probe with other integrin family 
5 members, or hybridization with alternatively spliced mRNAs. 

Northern analysis using a third fragment from clone 19A2, 
spanning nucleotides 2000 to 3100 in SEQ ID NO: 1, gave results consistent with 
those using the other clone 19A2 fragments. 

RNA from three myeloid lineage cell lines was also probed using 

10 the fragments corresponding to nucleotides 500 to 2100 and 2000 to 3100 in SEQ 
ID NO: 1: A THP-1 cell line, previously stimulated with PMA, gave a diffuse 
signal in the same size range (approximately 5.0 kb), with a slightly stronger 
intensity than the tissue signals. RNA from unstimulated and DMSO-stimulated 
HL-60 cells hybridized with the a d probe at the same intensity as the tissue 

15 samples, however, PMA treatment seemed to increase the signal intensity. Since 
PMA and DMSO drive HL-60* cell differentiation toward monocyte/macrophage 
and granulocyte pathways, respectively, this result suggests enhanced a d 
expression in monocyte/macrophage cell types. U937 cells expressed the a d 
message and this signal did not increase with PMA stimulation. No band was 

20 detected in Molt, Daudi, H9, JY, or Jurkat cells. 

■ Example 7 

Transient Expression of Human ot d Constructs 

r A. Generation of expression constructs 

The human clone 19A2 lacks an initiating methionine codon and 
25 possibly some of the 5 ' signal sequence. Therefore, in order to generate a human 
expression plasmid containing 19A2 sequences, two different strategies were used. 
In the first, two plasmids were constructed in which signal peptide sequences 
derived from genes encoding either CD lib or CDllc were spliced into clone 
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19A2 to generate a chimeric a d sequence. In the second approach, a third 
piasmid was constructed in which an adenosine base was added at position 0 in 
clone 19A2 to encode an initiating methionine. 

The three plasmids contained different regions which encoded the 
5 5 ' portion of the a d sequence or the chimeric a d sequence. The a d region was 
PCR amplified (see conditions in Example 2) with a specific 3 ' primer BamRev 
(set out below in SEQ ID NO: 26) and one of three 5 ' primers. The three 5 ' 
primers contained in sequence: (1) identical nonspecific bases at positions 1-6 
allowing for digestion, an EcoRLsite from positions 7-12 and a consensus Kozak 

10 sequence from positions 13-18: (2) a portion of the CDllb (primer ER1B) or 
CDllc (primer ERIC) signal sequence, or an adenosine (primer ER1D); and (3) 
an additional 15-17 bases specifically overlapping 5 ' sequences from clone 19 A2 
to allow primer annealing. Primers ER1B. ERIC or ER1D are set out in SEQ 
ID NOS: 27, 28 or 29, respectively, where the initiating methionine codon is 

IS : underlined and the EcoHI site is double underlined. 

5 ' -CC ACTGTC AGG ATGCCCGTG-3 ' (SEQ ID NO: 26) 

S^-AGTTAC GAATTC GCCACCATGGCTCTACGGGTGCTIISIPQIX^BJ: 27) 

20 . 5 '-AGTTACGAATTCGCCACCATGACTCGGACTC 28) 

5 ^AGTTAC GAATTC GCCACCATGACCTTCGGCACTGTaaEO ID NO: 29) 

, The resulting PCR product was digested with EccRl and BarriHI. 

All three plasmids contained a common second a d region (to be 
25 inserted immediately downstream from the 5 * region described in the previous 
paragraph) including the : 3 ' end of the a d clone. The second a d region, which 
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extended from nucleotide 625 into the Xbai site in the vector 3 ' polylinker region 
of clone 19A2, was isolated by digestion of clone 19A2 with BamHL and XbaL 
Three ligation reactions were prepared in which the 3' a d 
BamHl/Xbal fragment was ligated to one of the three 5 ' a d EcoKUBamKL 
5 fragments using Boehringer Mannheim ligase buffer and T4 ligase (1 unit per 
reaction). After a 4 hour incubation at 14°C, an appropriate amount of vector 
pcDNA.3 (Invitrogen) digested with EcoKL and Xbal was added to each reaction 
with an additional unit of ligase. Reactions were allowed to continue for another 
14 hours. One tenth of the reaction mixture was then transformed into competent 

10 XL-1 Blue cells. The resulting colonies were cultured and the DNA isolated as 
in Example 5. Digestion with EddSl identified three clones which were positive 
for that restriction site, arid thus, the engineered signal sequences. The clones 
were designated pATM.Bl (CDllb/a d , from primer ER1B), pATM.CIO 
(CDllc/a d , from primer ERIC) and pATM.D12 (adenosine/a d from primer 

15 ERld). The presence of the appropriate signal sequences in each clone was 
verified by nucleic acid sequencing. 

B. Transfection of COS Cells 

Expression from the a d plasmids discussed above was effected by 
cotransfection of COS cells with the individual plasmids and a CD 18 expression 

20 plasmid, pRC.CD18. As a positive control, COS cells were also co-transfected 
with the plasmid pRC.CD18 and a CDlla expression plasmid, pDC.CDHA. 

Cells were passaged in culture medium (DMEM/10%FBS/pen- 
strep) into 10 cm Corning tissue culture-treated petri dishes at 50% confluency 16 
hours prior to transfection. Cells were removed from the plates with Versene 

25 buffer (0.5 mM NaEDTA in PBS) without trypsin for all procedures. Before 
transfection, the plates were washed once with serum-free DMEM. Fifteen 
micrograms Of each plasmid were added to 5 ml transfection buffer (DMEM with 
20 jig/ml DEAE-Dextran and 0.5 mM chloroquine) on each plate. After 1.5 
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hours incubation at 37" C, the cells were shocked for 1 minute with 5 ml 
DMEM/10% DMSO. This DMSO solution was then replaced with 10 ml/plate 
culture medium. 

Resulting transfectants were analyzed by ELISA, FACS, and 
5 immunoprecipitation as described in Examples 8, 9, and 10. 

Example 8 

ELISA Analysi s of COS Transfectants . 

In order to determine if the COS cells co-transfected with CD 18 
expression plasmid pRC. CD 18 and an a d plasmid expressed a d on the cell surface 

10 in association with CD18, ELISAs v/exe performed using primary antibodies 
raised against CD18 {e.g., TS1/18 purified from ATCC HB203). As a positive 
control, EIJSAs were also performed on cells co-transfected with the CD18 
expression plasmid and a CD1 la expression plasmid, pDC.CDll A. The primary 
antibodies in this control included CD 18 antibodies and anti-CD 11a antibodies 

15 (e.g., TS1/22 purified from ATCC RB202). 

For ELISA, cells from each plate were removed with Versene 
buffer and transferred to a single 96-well flat-bottomed Corning tissue culture 
plate. Cells were die wed to incubate in culture media 2 days prior to assay. The 
plates were then washed twice with 150 /d/well D-PBS/0.5% teleost skin gelatin 

20 (Sigma) solution. t This buffer was used in all steps except during the 
development. All washes and incubations were performed at room temperature. 
The wells were blocked with gelatin solution for 1 hour- Primary antibodies were 
diluted to 10 /ig/ml in gelatin solution and. 50 >xl were then added to each well. 
Triplicate wells were set up for each primary antibody. After 1 hour incubation, 

25 . . plates were washed 3X with 150 jd/well gelatin solution. Secondary antibody 
(goat anti-mouse Ig/HRP-Fc specific [Jackson, West Grove, PA]) at a 1:3500 
dilution was added at 50 ptl/well and plates were incubated for 1 hour. After 
three washes, plates were developed for 20 minutes with 100 /il/well o- 
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phenyldiamine (OPD) (Sigma) solution (1 mg/ml OPD in citrate buffer) before 
addition of 50 /tl/well 15% sulfuric acid. 

Analysis of transfectants in the ELISA format with anti-CD 18 
specific antibodies revealed no significant expression above background in cells 
5 transfected only with the plasmid encoding CD 18. Cells co-transfected with 
plasmid containing CDlla and CD18 showed an increase in expression over 
background when analyzed with CD18 specific antibodies or with reagents 
specific for CD 11 a. Further analysis of cells co-transfected with plasmids 
encoding CD18 and one of the a d expression constructs (pATM.ClO or 
10 pATM.D12) revealed that cell surface expression of CD18 was rescued by 
concomitant expression of a d . The increase in detectable CD 18 expression in 
COS cells transfected with pATM.CIO or pATM.D12 was comparable to that 
observed in co-transfected CDlla/CD18 positive control cells. 

Example 9 

15 FACS Analysis of COS Transfectants 

For FACS analysis, cells in petri dishes were fed with fresh culture 
medium the day after transfecrion and allowed to incubate 2 days prior to the 
assay. Transfectant cells were removed from the plates with 3 ml Versene, 
washed once with 5 ml FACS buffer (DMEM/2% FBS/0.2% sodium azide) and 

20 diluted to 500,000 cells/sample in 0. 1 ml FACS buffer. Ten microliters of either 
1 mg/ml FTTC-conjugated CD18, CDlla, or GDI lb specific antibodies (Becton 
Dickinson) or 800 >g/ml CFSE-conjugated murine 23F2G (anti-CD18) (ATCC 
HB11081) were added to each sample. Samples were then incubated on ice for 
45 minutes, washed 3X with 5 ml/ wash FACS buffer and resuspended in 0.2 ml 

25 FACS buffer. Samples were processed on a Becton Dickinson FACscan and the 
data analyzed using Lysys II software (Becton Dickinson). 

COS cells transfectesd with CD 18 sequences only did not stain for 
CD18, CDlla or CDllb. When co-transfected with CD1 la/CD 18, about 15% 
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of the cells stained with antibodies to CD 11a or CD18. All cells transfected with 
CD18 and any a d construct resulted in no detectable staining for CDlla and 
CDllb. The pATM.Bl, pATM.CIO and pATM.D12 groups stained 4%, 13% 
and 8% positive for CD18, respectively. Fluorescence of the positive population 
5 in the CDlla/CD18 group was 4-fold higher than background. In comparison, 
the co-transfection of a d constructs with the CD18 construct produced a positive 
population that showed a 4- to 7-fold increase in fluorescence intensity over 
background. 

Example 10 

10 BiotinrLabeled Immunoprecipitation of 

Human a d /CD18 Complexes from Co-transfected COS Cells 

Immunoprecipitatipn was attempted on cells co-transfected with 
CD 18 and each of the a d expression plasmids separately described in Example 7 
in order to determine if a d could be isolated as part of the a/3 heterodimer 

15 complex characteristic of integrins. 

Transfected cells (1-3 x 10 8 cells/group) were removed from petri 
dishes with Versene buffer and washed 3 times in 50 ml/group D-PBS. Each 
sample was labeled with 2 mg Sulpho-NHS Biotin (Pierce, Rockford, IL) for 15 
minutes at room temperature. The reaction was quenched by washing 3 times in 

20 50 ml/sample cold D-PBS. Washed cells were resuspended in 1 ml lysis buffer 
(1 % NP40, 50 mM Tris-HCl, pH 8.0, 0.2 M NaCl, 2mMCa + + ,2mMMg + + , 
and protease inhibitors) and incubated 15 minutes on ice. Insoluble material was 
pelleted by centrifugation at 10,000 g for 5 minutes, and the supernatant removed 
lo fresh tubes. In order to remove material non-specifically reactive with mouse 

25 immunoglobulin, a pre-clearance step was initially performed. Twenty-five 
micrograms of mouse immunoglobulin (Cappel, West Chester, PA) was incubated 
: with supernatants at 4*C. After 2.5 hr, 100 pi (25 fig) rabbit anti-mouse Ig 
conjugated Sepharose (prepared from Protein A Sepharose 4B and rabbit anti- 
mouse IgG, both from Zymed, San Francisco, CA) was added to each sample; 
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incubation was continued at 4 # C with rocking for 16 hours. Sepharose beads 
were removed from the supematants by centrifiigation. After pre-clearance, the 
supernatants were then treated with 20 /ig anti-CD 18 antibody (TS1.18) for 2 
hours at 4*C. Antibody /antigen complexes were isolated from supernatants by 
5 incubation with 100 ^I/sample rabbit anti-mouse/Protein A-sepharose preparation 
described above. Beads were washed 4 times with 10 mM HEPES, 0.2 M NaCl, 
and 1% Triton-X 100. Washed beads were pelleted and boiled for 10 minutes in 
20 fil 2X Laemmli sample buffer with 2% jS-mercaptoethanol. Samples were 
centrifiiged and run on an 8% prepoured Novex polyacrylamide gel (Novex) at 

10 100 V for 30 minutes. Protein was transferred to nitrocellulose membranes 
(Schleicher & Schuell) in TBSrT buffer at 200 mAmps for 1 hour. Membranes 
were blocked for 2 hr with 3% BSA in TBS-T. Membranes were treated with 
1:6000 dilution of Strep-avidin horse radish peroxidase (POD) (Boehringer 
Mannheim) for 1 hour, followed by 3 washes in TBS-T. The Amersham 

15 Enhanced Chemiluminescence kit was then used according to the manufacturer's 
instructions to develop the blot. The membrane was exposed to Hyperfilm MP 
(Amersham) for 0.5 to 2 minutes. 

Immunbprecipitation of CD 18 complexes from cells transfected 
with pRC CDl8 and either pATM.Bl, pATM.CIO or pATM.D12 revealed 

20 surface expression of a heterodimeric species consisting of approximately 100 kD 
0 chiin, consistent with the predicted size of CD18, and an a chain of 
approximately 150 kD, corresponding to a d . 

Example 11 

Stable Transfection of Human a d in Chinese Hamster Ovary Cells 
25 To determine whether a d is expressed on the cell surface as a 

heterodimer in association with CD i 8, cDN As encoding each chain were both 
transiently and stably transfected into a cell line iacking both <x d and CD18. 
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For these experiments, <* d cDNA was augmented with additional 
leader sequences and a Kozak consensus sequence, as described in Example 7, 
and subcloned into expression vector pcDNA3. The final construct, designated 
pATM.D12, was co-transfected with a modified commercial vector, pDCl.CD18 
5 encoding humaji CD 18 into dihydrofolate reductase (DHFR)" Chinese hamster 
ovary (CHO) cells. The plasmid pDCl.CDIS encodes a DHFR + marker and 
transfectants can be selected using an appropriate nucleoside-deficient medium. 
The modifications which resulted in pDCl.CD18 are as follows. 

The plasmid pRC/CMV (Invitrogen) is a mammalian expression 
10 vector with a cytomegalovirus promoter and, ampicillin resistance marker gene. 

A DHFR gene from the plasmid pSCl 190-DHFR was inserted into pRC/CMV 5 ' 
of the SV40 origin of replication. In addition, a polylinker from the 5 * region 
of the plasmid pHF2G-DHF was ligated into the pRC/CMV/DHFR construct, 3 ' 
to the DHFR gene. CD 18 encoding sequences are subsequently cloned into the 
15 resulting plasmid between the 5 ' flanking polylinker region and the bovine growth 
hormone poly A encoding region. 

Surface expression of CD 18 was analyzed by flow cytometry using 
the monoclonal antibody TS 1/18. Heterodimer formation detected between a d 
and CD 18 in this cell line was consistent with the immunoprecipitation described 
.20 in Example 10 with transient expression in COS cells. 

Example 12 : 

. . Human a d binds to ICAM-R in a CD18-dependent fashion 

In view, of reports that demonstrate interactions between the 
leukocyte integrins,and intercellular adhesion molecules (ICAMs) which mediate 
25 cell-cell qontact [Hynes, Cell 59:11-25 (1992)], the ability of CHO cells 
expressing a d /CD18 to bind ICAM-1, ICAM-R, or VCAM-1 was assessed by two 
methods. 
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In replicate assays, soluble ICAM-1, ICAM-R, or VCAM-1 IgGl 
fusion proteins were immobilized on plastic and the ability of a d /CD18 CHO 
transfected cells to bind the immobilized ligand was determined. Transfected cells 
were labeled internally with calcein, washed in binding buffer (RPMI with 1% 
5 BSA), and incubated in either buffer only (with or without 10 ng/ml PMA) or 
buffer with anti-CD 18 monoclonal antibodies at 10 /ig/ml. Transfected cells were 
added to 96- well Immulon 4 microtiter plates previously coated with soluble 
ICAM-1/IgGl, ICAM-R/IgGl or VCAM-1/IgGl fusion protein, or bovine serum 
albumin : (BSA) as a negative control. Design of the soluble forms of these 

10 adhesion molecules is described and fully disclosed in co-pending and co-owned 
U.S. Patent Application Serial No. 08/102,852, filed August 5, 1993. Wells were 
blocked with 1 % BSA ih PBS prior to addition of labeled cells. After washing 
the plates by immersion in PBS with 0. 1 % BSA for 20 minutes, total fluorescence 
remaining in each well was measured using a Cyiofluor 2300 (Millipore, Milford, 

15 MA). - - 

In experiments with immobilized ICAMs, a d /CD18 co-transfectants 
consistently showed a 3-5 fold increase in binding to ICAM-R/IgGl wells over 
BSA coated wells: The specificity and CD18-dependence of this binding" was 
demonstrated by the inhibitory effects of anti-CD18 antibody TS 1/18. The binding 

20 of cells transfected with CD 1 la/CD18 to lCAM-1/IgGl wells was comparable to 
the binding observed with BSA coated wells. CDlla/CD18 transfected cells 
showed a 2-3 fold increase in binding to ICAM-1/IgGl wells only following 
pretreatment with PMA. PMA treatment of a d /CD18 transfectants did not affect 
binding to ICAM-1/IgGl or ICAM-R/IgGl wells. No detectable binding of 

25 a d /CD18 transfeetants to VCAM-1/IgGl wells was observed. 

Binding of a d /CD18-transfected cells to soluble ICAM-1/IgGl, 
ICAM-R/IgGl, or VCAM 1/IgGl fusion proteins was determined by flow 
cytometry. Approximately one million a d /CD18-transfected CHO cells (grown in 
spinner flasks for higher expression) per measurement were suspended in 100 y\ 
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binding buffer (RPMI and 1% BSA) with or without 10 /xg/ml anti-CD18 
antibody. After a 20 minute incubation at room temperature, the cells were 
washed in binding buffer and soluble ICAM-1/IgGl or ICAM-R/IgGl fusion 
protein was added to a final concentration of 5 ug/ml. Binding was allowed to 
5 proceed for 30 minute at 37°C, after which the cells were washed three times and 
resuspended in 100 ii\ binding buffer containing FITC-conjugated sheep anti- 
human IgGl at a 1:100 dilution. After a 30 minute incubation, samples were 
washed three times and suspended in 200 /xl binding buffer for analysis with a 
Becton Dickinson FACScan. 

10 Approximately 40-50% of the a d /CD18 transfectants indicated 

binding to ICAM-R/IgGl, but no binding to ICAM-1/IgGl or VCAM-1/IgGl 
proteins. Pretreatment of transfected cells with PMA has no effect on a d /CD18 
binding to either ICAM-1/IgGl, ICAM-R/IgGl or VCAM-1/IgGl, which was 
consistent with the immobilized adhesion assay. Binding by ICAM-R was 

15 reduced to background , levels after treatment of c* d /CD18 transfectants with anti- 
CD^ antibody TS1/18. 

The collective data from these two binding assays illustrate that 
a d /CD18 binds to ICAM-R and does so preferentially as compared to ICAM-1 
and VCAM-1. The a d /CD13 binding preference for ICAM-R over ICAM-1 is 

20 opposite that observed with CDlla/CD18 and CDlIb/CD18. Thus modulation 
of or d /CD18 binding may be expected to selectively affect normal and pathologic 
immune function where ICAM-R plays a prominent role. Moreover, results of 
similar assays, in which antibodies immuncspecific for various extracellular 
domains of ICAM-R were tested for their ability to inhibit binding of ICAM-R 
■ 25 to a d /CD 1 8 transfectants, indicated that a d /CD 1 8 and CD 1 la/CDl 8 interact with 
different domains of ICAM-R. 

The failure of CDlla/CD18 to bind ICAM-1/IgGl or ICAM- 
R/IgGl in solution suggests that the affinity of binding between CD 1 la/CD 1 8 and 
. ICAM-1 or ICAM-R is too low to permit binding in solution. Detection of 
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a d /CD18 binding to ICAM-R/IgGl, however, suggests an unusually high binding 
affinity. 

a d Binding to iC3b 

. Complement component C3 can be proteolytically cleaved to form 
5 the complex iC3b, which initiates the alternative pathway of complement 
activation and leads ultimately to cell-mediated destruction of a target. Both 
CDllb and CDllc have been implicated in iC3b binding and subsequent 
phagocytosis of iC3b-coated particles. A peptide fragment in the CD 1 lb I domain 
has recently been identified as the site of iC3b interaction [Ueda, et al t 
10 Proc.NatlAcad.ScL (USA) 57:10680-10684(1994)]. The region of iC3b binding 
is highly conserved in. CDllb; CDllc, and a d , suggesting an a d /iC3b binding 
interaction. . .. 

Binding of a d to iC3b is performed using transfectants or cell lines 
naturally expressing a d (for example, PMA-stimulated HL60 cells) and iC3b- 
15 coated sheep red blood cells (sRBC) in a rosette assay [Dana, et al , 7. Clin. Invest. 

75:153-159 (1984)]. The. abilities of a d /CD18 CHO transfectants, VLA4-CHO 
transfectants (negative control) and PMA-stimulated HL60 cells (positive control) 
to form rosettes are compared in the presence and absence of an anti-CD18 
monoclonal antibody (for example TS 1/18.1). 

20 Example 13 

Screening bv Scintillation Proximity Assay . 

Specific inhibitors of binding between She a d ligands of the present 
invention and their binding- partners (a d ligand/anti-ligand pair) may be 
determined by a variety of means, such as scintillation proximity assay techniques 
25 as generally described in U.S. -Patent No. 4,271,139, Hart and Greenwald, 
Mollmmunpl 72:265-267 (1979), and Hart and Greenwald, J.Nuc.Med. 
20:1062-1065 (1979), each of which is incorporated herein by reference. 
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Briefly, one member of the oc d ligand/anti-ligand pair is bound to 
a solid support. A fluorescent agent is also bound to the support. Alternatively, 
the fluorescent agent may be integrated into the solid support as described in U.S. 
Patent No. 4,568,649, incorporated herein by reference. The non-support bound 
5 member of the a d ligand/anti-ligand pair is labeled with a radioactive compound 
that emits radiation capable of exciting the fluorescent agent. When the ligand 
binds the radiolabeled anti-ligand, the label is brought sufficiently close to the 
support-bound fluorescer to excite the fluorescer and cause emission of light. 
When not bound, the label is generally top distant from the solid support to excite 

10 the fluorescent agent, and light emissions are low. The emitted light is measured 
and correlated with binding between the ligand and the anti-ligand. Addition of 
a binding inhibitor to the sample will decrease the fluorescent emission by keeping 
the radioactive label from being , captured in the proximity of the solid support. 
Therefore, binding inhibitors may be identified by their, effect on fluorescent 

15 emissions from the samples. Potential anti-ligands to a d may also be identified 
by similar means. 

E xample 14 
Soluble Human a d Expression Construct s . 

The expression of full-length, .soluble human a d /CD18 
20 heterodimeric protein provides easily purified material for immunization and 
... binding assays. The advantage of generating soluble protein is that it can be 
. purified from supernatants rather than from cell lysates (as with full-length 
membrane-bound a d /CDl&)\ recovery in r therefore improved and impurities 
reduced. 

25 The soluble a d expression plasmid was constructed as follows. A 

nucleotide fragment corresponding to the region from bases 0 to 3161 in SEQ ID 
NO: 1, cloned into plasmid pATM.D12, was isolated by digestion with HindRl 
and AatU. A PGR fragment corresponding to bases 3130 to 3390 in SEQ ID NO: 
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1, overlapping the HindBllAatll fragment and containing an addition Mlul 
restriction site at the 3 ' terminus, was amplified from pATM.D12 with primers 
sHAD.5 and sHAD.3 set out in SEQ ID NOS: 30 and 31, respectively. 

5 '-TTGCTGACTGCCTGCAGTTC-3 ' (SEQ ID NO: 30) 

5 5 ' -GTTCTGACGCGTAATGGC ATTGTAGACGTCGTCTTC(SEQ ID NO: 31) 

The PCR amplification product was digested with AatU and Mlul and ligated to 
the HindnUAatU fragment. The resulting product was ligated into HindHllMluL- 
digested plasmid pDCLs. 

This construct is co-expressed with soluble CD 18 in stably 
10 transfected CHO celiV and expression is detected by autoradiographic 
visualization of immuhoprecipitated CD 18 complexes derived from 35 S-methionine 
labeled cells. The construct is also co-expressed with CD18 in 293 cells 
[Beimm: et al, J. CelLBiochem. 52:183-195 (1993)]. 

Soluble Human a d I Domain Expression Constructs 

15 It has previously been reported that the I domain in CD1 la can be 

expressed as an independent structural unit that maintains ligand binding 
capabilities and antibody recognition [Randi and Hogg, J.Biol Chem. 269: 12395- 
12398 (1994); Zhout, et al f J.BioLChem. 269: 17075-17079 (1994); Michishita, 
et al, Cell 72:857-867 (1993)1. To generate a soluble fusion protein comprising 

20 the a d I domain and human IgG4, the a d I domain is amplified by PCR using 
primers designed to add flanking BamHl and Xhol restriction sites to facilitate 
subcloning. These primers are set out in SEQ ID NOS: 32 and 33 with restriction 
sites underlined. 

5 '-ACGTATGCAGGAIQQCATC.\AGAGATGGACATCGCISEQ ID NO: 32) 
25 5 '-ACTGCATGT£I£GAGGCHX3AAGCOT ID NO: 33) 
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The C nucleotide immediately 3' to the BamHl site in SEQ ID NO: 32 
corresponds to nucleotide 435 in SEQ ID NO: 1; the G nucleotide 3 ' to the Xhol 
site in SEQ ID NO: 33 is complementary to nucleotide 1067 in SEQ ID NO: 1. 
The amplified I domain is digested with the appropriate enzymes, the purified 
5 fragment ligated into the mammalian expression vector pDCs and the prokaryotic 
expression vector pGEX-4T-3 (Pharmacia) and the I domain fragment sequenced. 
The fusion protein is then expressed in COS, CHO or E.coli cells transfected or 
transformed with an appropriate expression construct. 

Given the affinity of a d for ICAM-R, expression of the a d I 
10 domzun may be of sufficient affinity to be a useful inhibitor of cell adhesion in 
which a j participates. 

Analysis of Human ot d I Domain/igG4 Fusion Proteins 

Protein was resolved by SDS-PAGE under reducing and non- 
reducing conditions and visualized by either silver staining or Coomassie staining. 
15 Protein was then transferred to Immobilon PVDF membranes and subjected to 
Western blot analysis using anti-human IgG monoclonal antibodies or anti-bovine 
Ig monoclonal antibodies. 

Protein detected was determined to migrate at about 120 kD under 
non-reducing conditions and at about 45. kD under reducing conditions. Minor 
20 t . bands were also detected on non-reducing gels at approximately 40-50 kD which 
:;v were .reactive with the anti-human, but not anti-bovine, antibodies. A 200 kD 
minor band was determined to be bovine Ig by Western blot. 

Binding Assays Using I Domain E xpression Products , 

The ability of the I domain to specifically recognize ICAM-R/IgG 
25 chimeric protein was tested in an ELISA format - Serial dilutions of a d 

I domain IgG4 fusion protein (Ia d /IgG4) in TBS were incubated with ICAM- 
1/IgG, ICAM-R/IgG, VCAM-l/IgG, or an irrelevant IgGl myeloma protein 
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immobilized on Immulon IV RIA/EIA plates. CD 11a I domain/IgG chimeric 
protein and human IgG4/kappa myeloma protein were used as negative controls. 
Bound IgG4 was detected with the biotinylated anti-IgG4 monoclonal antibody 
HP6023 followed by addition of strepavidin-peroxidase conjugate and development 
with substrate o-phenyldiamine. 

In repeated assays, no binding of the CDlla/IgG4 protein or the 
IgG4 myeloma protein was detected with any of the immobilized proteins. The 
Ia d /IgG4 protein did not bind to fish skin gelatin or bovine serum albumin 
blocking agents, human IgGl, or ICAM-l/IgG. A two to three fold increase in 
binding signal over background was detected in ICAM-R/IgG protein coated wells 
using 1-5 jig/ml concentrations of Ia d /IgG4 protein. The signal in VCAM-l/IgG 
protein coated wells was 7-10 fold higher than background. In previous assays, 
a d /CD18 transfected.;eKO cells did not bind VCAM-l/IgG protein, suggesting 
that VCAMkl binding may be characteristic of isolated I domain amino acid 
sequences. 



Additional <* d t domain constructs 

Additional a d I domain constructs are generated in the same fashion 
as the previous construct, but incorporating more amino acids around the a d I 
domain. Spedfic Constructs include: i) sequences from exon 5 (amino acids 127- 
353 in SEQ ID NO: 2), preceding the current construct, ii) the EF-hand repeats 
(amino acids 17-603 in SEQ ID NO: 2) following the I domain, and iii) the alpha 
chain truncated at the transmembrane region (amino acids 17-1029 in SEQ ID 
NO: 2), with an IgG4 tail for purification and detection purposes. These 
constructs are ligated into either the mammalian expression vector pDCSl or the 
prokaryotic expression vector pGEX-4T-3 (Pharmacia) and the I domain 
sequenced. The fusidn proteins are then be expressed in COS, CHO, or E.coli 
cells transformed or transfected with an appropriate expression construct. Protein 
are purified on a ProSepA column (Bioprbcessing Limited, Durham, England), 
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tested for reactivity with the anti-IgG4 monoclonal antibody HP6023 and 
visualized on polyacrylamide gels with Coomassie staining. 

In order to construct an expression plasmid for the entire a d 
polypeptide, pATM.D12, described supra, is modified to express an a d -IgG4 
5 fusion protein by the following method. IgG4 encoding DNA is isolated from the 
vector pDCSl by PCR using primers which individually incorporate a 5 ' AatU 
restriction site (SEQ ID NO: 89) and a 3 ' Xbal restriction site (SEQ ID NO: 90). 

5 '-CGCTGTGACGTCAGAGTTGAGTCCAAATATGG-3 ' (SEQ ID NO: 89) 
5 '-GGTGACACTATAGAATAGGGC-3 ' (SEQ ID NO: 90) 

10 Plasmid pATM.D12 is digested with Aatll and Xbal, and the appropriately 
digested and purified IgG4 PCR product ligated into the linear vector. 
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ExampSe 15 

Production of Human a d -Specific Monoclonal Antibodies 

Transiently transfected cells from Example 7 were washed three 
times in Dulbecco's phosphate buffered saline (D-PBS) and injected at 5 x 10 6 
5 cells/mouse into Balb/c mice with 50 /xg/mouse muramyl dipeptidase (Sigma) in 
PBS. Mice were injected two more times in the same fashion at two week 
intervals. The pre-bleed and immunized serum from the mice were screened by 
FACS analysis as outlined in Example 9 and the spleen from the mouse with the 
highest reactivity to cells transfected with a d /CD18 was fused. Hybridoma 
10 culture supcrnatants were then screened separately for lack of reactivity against 
COS cells transfected with CDlla/CD18 and for reactivity with cells co- 
transfected with an cr d expression plasmid and CB18. 

This method resulted in no monoclonal antibodies. 

As an alternative for production of monoclonal antibodies, soluble 
15 <* d I domain IgG4 fusion protein was affinity purified from supernatant of stably 
transfected CHO cells and used to immunize Balb/c mice as described above. 
Hybridomas were established and supernatants from these hybridomas were 
screened by ELISA for reactivity against a d I domain fusion protein. Positive 
cultures were then analyzed for reactivity with full length a d /CD18 complexes 
20 expressed on CHO transfectants. 

Mouse 1908 received three initial immunizations of c* d /CD18 
transfected CHO cells and two subsequent boosts with soluble a d /CD18 
heterodimer. Two final immunizations included 50 /xg/mouse Ia d /IgG4 fusion 
protein. The fusion produced 270 IgG-producing wells. Supernatant from 45 
25 wells showed at least 7-fold higher binding to Ia d /IgG4 fusion protein than to 
human IgG4 by ELISA. None of the supernatants reacted to a d /CD18 transfected 
CHO cells as determined by FACS analysis. 

To determine whether the supernatants were able to recognize 
integrin alpha subunit proteins in another context, fresh frozen splenic sections 
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were stained with supernatants from 24 of the 45 wells. Three supernatants were 
determined to be positive: one stained large cells in the red pulp, while two others 
stained scattered cells in the red pulp and also trabeculae. 

These supernatants were further analyzed by their ability to 
5 immunoprecipitate biotinylated CD1 8 complexes from either a d /CD 1 8 transfected 
CHO cells or PMA-stimulated HL60 cells. Fusion wells with supernatants that 
recognized protein in detergent lysates (which should not be as conformationally 
constrained as protein expressed as heterodimers) were selected for further 
subcloning. Monoclonal antibodies which recognize protein in detergent may be 
10 more useful in immunoprecipitation of heterodimeric complexes from 
transfectants, tissues, and cell lines. 

As another alternative: monoclonal antibodies are generated as 
follows. Affinity purified o^/CDlS heterodimeric protein from detergent lysates 
of stably transfected CHO cells is used with 50 ptg/ml muramyl dipeptidase to 
15 immunize Balb/c mice as described above. Mice receive three immunizations 
before serum reactivity against a^/CDlS is determined by immunoprecipitation 
of biotinylated complexes in the CHO transfectants. Hybridomas from positive 
animals are established, according to standard protocols, after which hybridoma 
cultures are selected by flow cytometry using a d /CD18 transfectants. 
20 CDlla/CD18 transfectants are titilized to control for CD18-only reactivity. 

As another alternative for monoclonal antibody production, Balb/c 
mice undergo an jmmunization/immunosuppression protocol designed to reduce 
reactivity to CHO cell determinants on transfectants used for immunization. This 
protocol .involves immunization with untransfected CHO cells and subsequent 
25 killing of CHO-reactive B-cell blasts with cyclophosphamide treatment. After 
three rounds of immunization and cyclophosphamide treatment are performed, the 
mice are immunized with or d /CD18 CHO transfected. cells as described above. 

As still another alternative, heterodimeric CD 18 complexes are 
immunoprecipitated from detergent lysates of whole spleen using an anti-CD 18 
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monoclonal antibody, following preclearance of CD 1 la/CD 18 and CDllb/CD18. 
CDlla/CD18 and CD Lib/CD 18 complexes are precleared by affinity 
chromatography using monoclonal antibodies TS2/4 and Mol, respectively, 
coupled to a chromatographic resin. The remaining CD18 complexes are used as 
5 an immunogen in Balb/c mice for the first immunization. Three immunizations 
are given at three week intervals, the initial immunization administered in 
conjunction with Freund's Complete Adjuvant and the subsequent immunizations 
with Freund's Incomplete Adjuvant. Serum is assayed for a d -specific reactivity 
by immunoprecipitation. Resulting hybridomas are screened by flow cytometry 

10 with a d /CD18 CHO transfectants. 

As another alternative, CD 18 complexes from detergent lysates of 
PMA stimulated HL60 cells are enriched by preclearance as described above. 
Other j£J2 mtegrins 'are' cleared on the same columns. Immunization with the 
resulting complexes, hybridoma production, and screening protocols are 

15 performed as described supra. 

■ - Example 16 

Analysis of a d distributio n with polyclonal serum 

Tissue distribution of a d /CD18 was determined using polyclonal 
antiserum* Antiserum used to stain tissue was obtained from a mouse immunized 

20 • 3 times with a d transited CHO dells (D6.CHO, a d /CD18) with adjuvant peptide 
and once with purified a d /CD18 heterodimer. A final boost included only 
a d /CD18 heterodimer. Approximately 100 p\ immunized serum was precleared 
by addition of approximately 10 8 LFA-l-transfected CHO cells for 2 hours at 
4°C. The resulting serum 'was assayed for a d reactivity at dilutions of 1/5000, 

25 1/10000, 1/20000 arid 1/40000 on normal human spleen. The polyclonal antibody 
was reactive at a dilution of 1/20000, while a 1/40000 dilution stained very 
weakly. 
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Once serum was determined to have specific a d reactivity, it was 
used to stain various lymphoid and non-lymphoid tissues. Monoclonal antibodies 
recognizing CD18, CDlla, CDllb, and CDllc were used in the same 
experiment as controls. Staining of normal spleen sections with a d polyclonal 
sera, and monoclonal antibodies to CDlla, CDllb, CDllc, and CD18 revealed 
the following results. The pattern observed with a d polyclonal sera did not 
display the same pattern of labeling as CDlla, CDllb, CDllc, or CD18. There 
is a distinct pattern of labeling with some cells located in the marginal zone of the 
white pulp and a distinct labeling of cells peripheral to the marginal zone. This 
pattern was not observed with the other antibodies. Individual cells scattered 
throughout the red pulp were also labeled which may or may not be the same 
population or subset seen with CD1 la and CD18. 

Labeling with CDllc did display some cells staining in the 
marginal zone, but the antibody did not show the distinct ring pattern around the 
white pulp when compared to a d polyclonal sera, nor did labeling in the red pulp 
give the same pattern of staining as a d polyclonal sera. 

Therefore, the labeling pattern seen with a d polyclonal serum was 
unique compared to that seen using antibodies to the other jS 2 integrins (CDlla, 
CDllb, CDl lc, and CD18), and suggests that the in vivo distribution of a d in 
man is dinstinct from that of other j8 2 integrins. 

Example 17 . 

Isolation of Rat cDNA Clones 

In view of the existence of both canine and human a d subunits, 
attempts were made jo. isolate homologous genes in other species, including rat 
(this example) and mouse (Example 17, infra). 

A partial sequence of a rat cDNA showing homology to the human 
a d gene was obtained from a rat splenic XgtlO library (Clontech). The library 
was plated at 2 x 10 4 pfu/plate onto 150 mm LBM/agar plates. The library was 
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lifted onto Hybond membranes (Amersham), denatured 3 minutes, neutralized 3 
minutes and washed 5 minutes with buffers as described in standard protocols 
[Sambrook, et al. Molecular Cloning: a l aboratory manual T p.2.110]. The 
membranes were placed immediately into a Stratalinker (Stratagene) and the DNA 
5 crosslinked using the autocrosslinking setting . The membranes were 
prehybridized and hybridized in 30% or 50% formamide, for low and high 
stringency conditions, respectively. Membranes were initially screened with a 
32 P-labeled probe generated from the human a d cDNA, corresponding to bases 
500 to 2100 in clone 19 A2 (SEQ ID NO: 1). The probe was labeled using 

10 Boehringer Mannheim's Random Prime Kit according to manufacturer's suggested 
protocol. Filters were washed with 2X SSC at 55°C. 

Two clones, designated 684.3 and 705.1, were identified which 
showed sequence homology to human a d , human CDllb, and human CDllc. 
Both clones aligned to the human a d gene in the 3 ' region of the gene, starting 

15 at base 1871 and extending to base 3012 for clone 684.3, and bases 1551 to 3367 
for clone 705.1. 

In order to isolate a more complete rat sequence which included the 
5 ' region, the same library was rescreened using the same protocol as employed 
for the initial screening, but using a mouse probe generated from clone A1160 
20 (See Example 17, infra). Single, isolated plaques were selected from the second 
screening and maintained as single clones on LBM/agar plates. Sequencing 
primers 434FL and 434FR (SEQ ID NOS: 34 and 35, respectively) were used in 
a standard PCR protocol to generate DNA for sequencing. 

5 '-TATAGACTGCTGGGTAGTCCCCAC-3 * (SEQ ID NO: 34) 

25 5 '-TGAAGATTGGGGGTAAATAACAGA-3 ' (SEQ ID NO: 35) 

DNA from the PCR was purified using a Quick Spin Column (Qiagen) according 
to manufacturer's suggested protocol. 
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Two clones, designated 741.4 and 741.11, were identified which 
overlapped clones 684.3 and 705.1; in the overlapping regions, clones 741.1 and 
741.11 were 100% homologous to clones 684.3 and 705.1. A composite rat 
cDNA having homology to the human a d gene is set out in SEQ ID NO: 36; the 
5 predicted amino acid sequence is set forth in SEQ ID NO: 37. 

Cloninp of the 5 ' end of Rat a d 

A 5 ' cDNA fragment for the rat a d gene was obtained using a 
Clonetech rat spleen RACE cloning kit according to manufacturer's suggested 
protocol. The gene specific oligonucleotides used were designated 741.11#2Rand 
10 741.2#1R (SEQ ID NOS: 59,and 58, respectively). 

5 '-CCAAAGCTGGCTGCATCCTCTC-3 ' (SEQ ID NO: 59) 

5 '-GGCCTTGCAGCTGGACAATG-3 ' (SEQ ID NO: 58) 

Oligo 741.11#2R encompasses, base pairs 131-152 in SEQ ID NO: 36, in the 
reverse orientation and 74 1.2#1R encompasses bases pairs 696-715 in SEQ ID 
15 NO: 36, also in the reverse orientation. A primary PCR was carried out using 
. the 3/-most oligo, 741.2#1R. A second PCR followed using oligo 741.11#2R 
, . . and DNA. generated, from the primary reaction. A band of approximately 300 
basepairs was detected on a 1% agarose gel. 

The secondary PCR product was, ligated into plasmid pCRTAII 
20 (Invitrogen) according to manufacturer's suggested protocol. White (positive) 
colonies were picked and added to 100 ji\ LBM containing 1 /d of a 50 mg/ml 
carbenicillin stock solution and 1 #1 M13 K07 phage culture in individual wells 
in a round bottom 96 well tissue culture plate. The mixture was incubated at 
... 3.7 °C for .30 minutes to one hour. Following the initial incubation. period, 100 ^1 
25 of LBM (containing 1 /xl of 50 mg/ml carbenicillin and a 1:250 dilution of a 10 
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mg/ml kanamycin stock solution) were added and the incubation was continued 
overnight at 37°C. 

Using a sterile 96 well metal transfer prong, supernatant from the 
96 well plate was transferred to four Amersham Hybond nylon filters. The filters 
5 were denatured, neutralized and cross linked by standard protocols. The filters 
were prehybridized in 20 mis of prehybridization buffer (5X SSPE; 5X 
Denhardts; 1% SDS; 50 ugs/ml denatured salmon sperm DNA) at 50°C for 
several hours while shaking. 

OUgo probes 741.1 I#l and 741.1 1#1R(SEQ ID NOS: 56 and 57, 
10 respectively), encompassing base pairs 86-105 (SEQ ID NO: 36) in the forward 
and reverse orientation respectively, were labeled as follows. 

5 '-CCTGTCATGGGTCTAACCTG-3 * (SEQ ID NO: 56) 

5 '-AGGTTAGACCCATGACAGG-3 ' (SEQ ID NO: 57) 

Approximately 65 ng oligo DNA in 12 /d dH 2 0 was heated to 65°C for two 
15 minutes. Three pi of 10 niCi/ml 7- 32 P-ATP were added to the tube along with 
4 pi 5x Kinase Buffer (Gibco) and 1 pi T4 DNA Kinase (Gibco). The mixture 
was incubated at 37°C for 30 minutes. Following incubation, 16 pi of each 
labeled oligo probe were added to the prehybridization buffer and filters and 
hybridization was continued overnight at 42°C. The filters were washed three 
20 times in 5X SSPE; 0.1% SDS for 5 minutes per wash at room temperature, and 
autoradiographed for 6 hours. Positive clones were expanded and DNA purified 
using the Magic Mini Prep Kit (Promega) according to manufacturer's suggested 
protocol. Clone 2F7 was selected for sequencing and showed 100% homology 
" clone 741.11 in the overlapping region. The complete rat a d nucleic acid 
25 sequence is set out in SEQ ID NO: 54; the amino acid sequence is set out in SEQ 
ID NO: 55. 
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Charac teristics of the R at cDNA and Amino Acid Sequences 

Neither nucleic acid nor amino acid sequences have previously been 
reported for rat a subunits in /3 2 integrins. However sequence comparisons to 
reported human /S 2 integrin a subunits suggests that the isolated rat clone and its 
predicted amino acid sequence are most closely related to a d nucleotide and amino 
acid sequences. 

At the nucleic acid level, the isolated rat cDNA clone shows 80% 
identity in comparison to the human a d cDNA; 68% identity in comparison to 
human CDllb; 70% identity in comparison to human CDllc; and 65% identity 
in comparison to mouse CDllb. No significant identity is found in comparison 
to human CDlla and to mouse CDl : la. 

At the amino acid level, the predicted rax polypeptide encoded by 
the isolated cDNA shows 70% identity in comparison to human a d polypeptide; 
28% identity in comparison to human CDlla; 58% identity in comparison to 
human CDllb; 61% identity in comparison to human CDllc; 28% identity in 
: comparison to mouse CDlla; and 55% identity in comparison to mouse CDllb. 

Example 18 

Monoclonal Antibodies again st Rat oc d I domain/Hu IpG4 Fusion Proteins 

In view of the fact that the, I domain of human integrins has 
. : been demonstrated to participate in ligand binding, it was assumed that the same 
* would be true for rat ot d protein. Monoclonal antibodies immunospecific for the 
rat a d I domain may therefore be useful in rat models of human disease states 
wherein a d binding is implicated. : 

Oligcs M rat alpha-DI5" (SEQ IDiNO: 87) and "rat alpha-DI3" (SEQ 
ID NO: 88) were generated from the rat a d sequence corresponding to base pairs 
. 469-493 and base pairs 1101-1125 (in the reverse orientation), respectively, in 
SEQ ID NO: 54. The cliges were used in a standard PGR reaction to generate 
a rat a d DNA fragment containing the I domain spanning base pairs 459-1125 in 
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SEQ ID NO: 54 e The PGR product was ligated into vector pCRTAII (Invitrogen) 
according to manufacturer's suggested protocol. A positive colony was selected 
and expanded for DNA purification using a Qiagen (Chatswoth, GA) Midi Prep 
kit according to manufacturer's protocol. The DNA was digested with Xhol and 
BglFL in a standard restriction enzyme digest and a 600 base pair band was gel 
purified which was subsequently ligated into pDCSl/HuIgG4 expression vector. 
A positive colony was selected, expanded and DNA purified with a Quiagen Maxi 
Prep Kit. 

COS cells were plated at half confluence on 100mm culture dishes 
and grown overnight at 37°C in 7% C0 2 . Cells were rinsed once with 5 ml 
DMEM. To 5 ml DMEM* 50 pX DEAE-Dextran , 2 >1 chioroquine and 15 fig rat 
a d I domain/HulgG4 DNA described above was added. The mixture was added 
to the COS cells and incubated at 37°G for 3 hours. Media was then removed 
and 5 ml 10% DMSO in CMF-FBS was added for exactly one minute. The cells 
were gently rinsed once with DMEM. Ten ml DMEM containing 10% FBS was 
added to the cells and incubation continued overnight at 37<>C in 7% C0 2 . The 
next day, media was replaced with fresh media and incubation continued for three 
additional days. The media was harvested and fresh media was added to the 
plate. After three days, the media was collected again and the plates discarded. 
The procedure was repeated until 2 liters of culture supernatant were collected. 

Supernatant collected as described above was loaded onto a 
ProseprA column (Bioprocessirig Limited) and protein purified as described 
below. 

The column was initially washed with 15 column volumes of Wash 
Buffer containing 35 mM Tris and 150 mM NaCl, pH 7.5. Supernatant was 
loaded at a slow rate of less than approximately 60 column volumes per hour. 
After loading, the column was washed with 15 column volumes of Wash Buffer, 
15 column volumes of 0.55 M diethanblamine, pH 8;5, and 15 column volumes 
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50 mM citric acid, pH 5.0. Protein was eluted with 50 mM citric acid, pH 3.0. 
Protein was neutralized with 1.0 M Tris, pH 8.0, and dialyzed in sterile PBS. 

The rat a d I domain protein was analyzed as described in Example 
14. The detected protein migrated in the same manner as observed with human 
5 I domain protein. 

Immunization Protocol 

Mice were individually immunized with 50 purified rat a d I 
domain/HuJ.gG4 fusion protein previously emulsified in an equal volume of 
Freunds Complete Adjuvant (FC A) (Sigma). Approximately 200 fil of the 

10 antigen/adjuvant preparation was injected at 4 sites in the back and flanks of each 
of the mice. Two weeks later the mice were boosted with an injection of 100 /il 
rat a d I domain/HuIgG4 antigen (50 ag/mouse) previously emulsified in an equal 
volume of Freunds Incomplete Adjuvant (FIA). After two additional weeks, the 
mice were boosted with 50 fig antigen in 200 ul PBS injected intravenously. 

15 To evaluate serum titers in the immunized mice, retro-orbital bleeds 

were performed on the animals ten days following the third immunization. The 
blood was allowed to clot and serum isolated by centrifugation. The serum was 
; used in an immunoprecipitation on biotinylated (BIP) rat splenocytes. Serum 
from each mouse immunoprecipitated protein bands of expected molecular weight 

20 for rat a d and rat GDI 8. One mouse was selected for the fusion and was boosted 
a fourth time as described above for the third boost. 

The hybridoma supematants were screened by antibody capture, 
described as follows. Immulon 4 plates (Dynatech, Cambridge, Massachusetts) 
were coated at 4°C with 50 /xl/well goat anti-mouse IgA, IgG or IgM (Organon 

25 Teknika) diluted 1:5000 in 50 mM carbonate buffer, pH 9.6. Plates were washed 
3X with PBS containing 0.05% Tween 20 (PBST) and 50 /xl culture supernatant 
was added. After incubation at 37°C for 30 minutes, and washing as described 
above, 50 /il horseradish peroxidase-cenjugated goat anti-mouse IgG9(Fc) 
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(Jackson ImmunoResearch, West Grove, Pennsylvania) diluted 1:3500 in PBST 
was added. Plates were incubated as described above and washed 4X with PBST. 
Immediately thereafter, 100/il substrate, containing 1 mg/ml o-phenylene diamine 
(Sigma) and 0.1 /xl/ml 30% H 2 0 2 in 100 mM citrate, pH4.5, was added. The 
5 color reaction was stopped after 5 minutes with the addition of 50 jtl 15 % H 2 S0 4 . 
Absorbance at 490 nm was read on a Dynatech plate reader. 

Supernatant from antibody-containing wells was also analyzed by 
ELISA with immobilized rat <x d I domain/HulgG4 fusion protein. An ELISA with 
HuIgG4 antibody coated plates served as a control for reactivity against the IgG 
10 fusion partner. Positive wells were selected for further screening by BIP on rat 
splertocyte lysates using techniques described below. 

Biotinylation of Cell Surface Antigens 

Rats were sacrificed by asphyxiation with CO2 and spleens were 

removed using standard surgical techniques. Splenocytes were harvested by 
15 gently pushing the spleen through a wire mesh with a 3 cc syringe plunger in 20 

mis RPMI. Cells were collected into a 50 ml conical tube and washed in the 

appropriate buffer. 

Cells were washed three times in cold D-PBS and resuspended at 

a density of 10 8 to ICF cells in 40 ml PBS. Four mg of NHS-Biotin (Pierce) was 
20 added to the cell suspension and the reaction was allowed to continue for exactly 

15 minutes at room temperature. The cells were pelleted and washed three times 

in cold D-PBS. 

Cell Lysates 

Ceils were resuspended at a density of 10 8 cells/ml in cold lysis 
25 Buffer (1 % NP40; 50 mM Tris-HCl, pH 8.0; 150 mM NaCl; 2 mM CaCl; 2 mM 
MgCl; 1:100 solution of pepstain/leupeptine, and aprotinin, added just before 
adding to cells; and 0.0001 g PMSF crystals, added just before adding to cells). 
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Ly sates were vortexed for approximately 30 seconds, incubated for 5 minute at 
room temperature, and further incubated for 15 minutes on ice. Lysates were 
centrifuged for 10 minutes at 10,000 xg to pellet the insoluble material. 
Supernatant was collected into a new tube and stored at between 4°C and -20°C. 

5 Immunoprecipitation 

One ml cell lysate was precleared by incubation with 200 pi of a 
protein A sepharose slurry (Zymed) overnight at 4°C Precleared lysate was 
aliquoted into Eppendorf tubes at 50 jd/tube for each antibody to be tested. 
Twenty-five /xl of polyclonal serum or 100 to 500 ul of monoclonal antibody 

10 supernatant were added to the precleared lysates and the resulting mixture 
incubated for 2 hours at 4°C with rotation. One hundred (il rabbit anti-mouse 
IgG (Jackson) bound to protein A sepharose beads in a PBS slurry was then added 
and incubation continued for 30 minutes at room temperature with rotation. 
Beads were pelleted with gentle centrifugation, and washed three times with cold 

15 Wash Buffer (10 mM HEPES; 0.2 M NaCl; 1% Trition X-100). Supernatant 
was removed by aspiration, and 20 fil 2X SDS sample buffer containing 10% 
jS^mercaptoethanol was added. The sample was boiled for 2 minutes in a water 
bath, and the sample loaded onto a 5% SDS PAGE gel. Following separation, 
the proteins were transferred to nitrocellulose at constant current overnight. The 

20 nitrocellulose filters were blocked with 3% BSA in TBS-T for 1 hour at room 
temperature and the blocking buffer was removed. A 1:6000 dilution of 
Strepavidin-HRP conjugate (Jackson) in 0.1% BSA TBS-T was added and 
incubation continued for 30 minutes at room temperature. Filters were washed 
three times for 15 minutes each, with TBS-T and autoradiographed using 

25 Amersham's ECL kit according to manufacturer's suggested protocol. 
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Example 19 

Isolation of Mouse cDNA Clones 

Isolation of a mouse a d homolog was attempted. 
Cross-species hybridization was performed using two PCR- 
5 generated probes: a 1.5 kb fragment corresponding to bases 522 to 2047 from 
human clone 19A2 (SEQ ID NO: 1), and a 1.0 kb rat fragment which corresponds 
to bases 1900 to 2900 in human clone 19A2 (SEQ ID NO: 1). The human probe 
was generated by PCR using primer pairs designated ATM-2 and 9-10.1 set out 
in SEQ ID NOS: 38 and 39, respectively; the rat probe was generated using 
10 primer pairs 434L and 434R; set out in SEQ ID NOS: 34 and 35, respectively. 
Samples were incubated at 94°C for 4 minutes and subjected to 30 cycles of the 
temperature step sequence: 94 °C; 50°C 2 minutes; 72°C, 4 minutes. 

5 '-GTCCAAGCFGTCATGGGCCAG-3 ' (SEQ ID NO: 38) 

5 '-GTCCAGCAGACTGAAGAGCACGG-3 (SEQ ID NO: 39) 

15 The PCR products were purified using the Qiagen Quick Spin kit 

according to manufacturer's suggested protocol, and approximately 180 ng DNA 
was labeled with 200 /zCi [ 32 P]-dCTP using a Boehringer Mannheim Random 
Primer Labeling kit according to manufacturer's suggested protocol. 
Unincorporated isotope was removed using a Centri-sep Spin Column (Princeton 

20 Separations, Adelphia, NJ) according to manufacturer's suggested protocol. The 
probes were denatured with 0.2 N NaOH and neutralized with 0.4 M Tris-HCl, 
pH 8.0, before use. 

A mouse thymic oligo dT-priined cDNA library in lambda ZAP II 
(Stratagene) was plated at approximately 30,000 plaques per 15 cm plate. Plaque 

25 lifts on nitrocellulose filters (Schleicher & Schuell, Keene, NH) were incubated 
at 50°C with agitation for 1 hour in a prehybridization solution (8 ml/lift) 
containing 30% formamide. Labeled human and rat probes were added to the 
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prehybridization solution and incubation continued overnight at 50°C. Filters 
were washed twice in 2X SSC/0. 1 % at room temperature, once in 2X SSC/0. 1 % 
SDS at 37°C, and once in 2X ■ SSC/0. 1% SDS at 42°C. Filters were exposed on 
Kodak X-Omat AR film at -80 C C for 27 hours with an intensifying screen. 
5 Four plaques giving positive signals on duplicate lifts were 

restreaked on LB medium with magnesium (LBM)/carbenicillin (100 mg/ml) 
plates and incubated overnight at 37°C. The phage plaques were lifted with 
Hybond filters (Amersham), probed as in the initial screen, and exposed on Kodak 
X-Omat AR film for 24 hours at -80°C with an intensifying screen. 

10 Twelve plaques giving positive signals were transferred into low 

Mg ++ phage diluent containing 10 mM Tris-HCl and 1 mM MgCl 2 . Insert size 
was determined by PCR amplification using T3 and T7 primers (SEQ ID NOS: 
13 and 14, respectively) and the following reaction conditions. Samples were 
incubated at 94 P C for 4 minutes and subjected to 30 cycles of the temperature 

15 step sequence: 94 °C, for 15 seconds; 50°C, for 30 seconds; and 72 °C for 1 
minute. . 

Six samples produced distinct bands that ranged in size from 300 
bases to 1 kb. Phagemids were released via co-infection with helper phage and 
recircularized to generate Bluescript SK" (Stratagene). The resulting colonies 
20 . . were cultured in LBM/carbenicillin (100 mg/ml) overnight. DNA was isolated 
i with a Promega Wizard rniniprep kit (Madison, WI) according to manufacturer's 
suggested protocol. £coRI restriction analysis of purified DNA confirmed the 
molecular weights which were detected using PCR. Insert DNA was sequenced 
with M13 and M13 reverse. 1 < primers set out in SEQ ID NOS: 40 and 41, 
25 * respectively. 

5 '-TGTAAAACGACGGCCAGT-3 ' - (SEQ ID NO: 40) 

5 '-GGAAACAGCTATGACCATG-3 ' (SEQ ID NO: 41) 
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Sequencing was performed as described in Example 4. 
Of the six clones, only two, designated 10.3-1 and 10.5-2, provided 
sequence information and were identical 600 bp fragments. The 600 bp sequence 
was 68% identical to a corresponding region of human a d , 40% identical to 
5 human CDlla, 58% identical to human CDllc, and 54% identical to mouse 
CDllb. This 600 bp fragment was then utilized to isolate a more complete 
cDNA encoding a putative mouse a d homolog . 

A mouse splenic cDNA library (oligo dT and random-primed) in 
. lambda Zap II (Stratagene) was plated at 2.5 x 10 4 phage/15 cm LBM plate. 
0 Plaques were lifted on Hybond nylon transfer membranes (Amersham), denatured 
with 0.5 M NaOH/1.5 M NaCl, neutralized with 0.5 M Tris Base/1.5 M 
NaCl/1 L6 HC1, and washed in 2X SSC. The DNA was cross-linked to filters by 
ultraviolet irradiation, 

Approximately 500,000 plaques were screened using probes 10.3-1 
5 and 10.5-2 previously labeled as described supra. Probes were added to a 
prehybridization solution and incubated overnight at 50°C. The filters were 
washed twice in 2X SSC/0.1% SDS at room temperature, once in 2X SSC/0.1% 
SDS at 37°C, and once in 2X SSC/0. 1 % SDS at 42°C. Filters were exposed on 
Kodak X-Omat AR film for 24 hours at -80°C with an intensifying screen. 
) Fourteen plaques giving positive: signals on duplicate lifts were subjected to a 
secondary screen identical to that for the initial screen except for additional final 
high stringency washes in 2X SSC/ 0. 1 % SDS at 50°C, in 0.5X SSC/0. 1 % SDS 
at 50°C, and at 55°C in 0<2X SSC/0.1% SDS. The filters were exposed on 
Kodak X-Omat AR film at -80°C for 13 hours with an intensifying screen. 
» Eighteen positive plaques were transferred into low Mg ++ phage 

diluent and insert size determined by PCR amplification as described above. 
Seven of the samples gave single bands that ranged in size from 600 bp to 4 kb. 
EcoBI restriction analysis of purified DNA confirmed the sizes observed from 
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PGR and the DNA was sequenced with primers M13 and M13 reverse. 1 (SEQ ID 
NOS: 40 and 41, respectively). 

One clone designated B3800 contained a 4 kb insert which 
corresponded to a region 200 bases downstream of the 5 ' end of the human a d 
19A2 clone and includes 553 bases of a 3 ' untranslated region. Clone B3800 
showed 77% identity to a corresponding region of human a d , 44% identity to a 
corresponding region of human CDlla, 59% identity to a corresponding region 
of human CD1 1c, and 51% identity to a corresponding region of mouse CDllb. 
The second clone A 1 160 was a 1.2 kb insert which aligned to the 5 ' end of the 
coding region of human a d approximately 12 nucleic acids downstream of the 
initiating methionine. Clone A1160 showed 75% identity to a corresponding 
region of human o d , 46% identity to a corresponding region of human CDlla, 
62% identity to a corresponding region of human CDllc, and 66% identity to a 
corresponding region of mouse CDllb. 

Clone A1160, the fragment closer to the 5 ' end of human clone 
19 A2, is 1 160 bases in length, and shares a region of overlap with clone B3800 
starting at base 205 and continuing to base 1134. Clone A1160 has a 110-base 
insertion (bases 704-314 of clone A 1160) not present in the overlapping region of 
clone B3800. This insertion occurs at a probable exon-intron boundary [Fleming, 
et al, J.lmmmoL J5G, 480-490 (1993)] and was removed before subsequent 
ligation of clones A1160 and B3800. 

Rapid Amplification of 5 * cDNA End of the Putative Mouse C1npe 

RACE PCR [Frohman, "RACE: Rapid Amplification of cDNA 
En*." in PCR Protocols: A Guide to Methods and Application, innis, et al 
(eds.) pp. 28*38, Academic PressrNew York (1990)] was used to obtain missing 
5 ' sequences of the putative mouse a d clone, including 5 * untranslated sequence 
and initiating methionine. A mouse splenic RACE-Ready kit (Clontech, Palo 
Alto, CA) was used according to the manufacturer's suggested protocol. Two 
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antisense, gene-specific primers, A1160 RACE 1 -primary and A1160 RACE2- 
nested (SEQ ID NOS: 42 and 43), were designed to perform primary and nested 
PCR. 

5 '-GGACATGTTCACTGCCTCTAGG-3 ' (SEQ ID NO: 42) 

5 5 '-GGCGGACAGTCAGACGACTGTCCTG-3 ' (SEQ ID NO: 43) 

The primers, SEQ ID NOS: 42 and 43, correspond to regions 
starting 302 and 247 bases from the 5 * end, respectively. PCR was performed 
as described, supra, using the 5 ' anchor primer (SEQ ID NO: 44) and mouse 
spleen cDNA supplied with the kit. 

10 5 '-CTGGTTCGGCCCACCTCTGAAGGTTCCAGAATCGAT(fiB5BID NO: 44) 

Electrophoresis of the PCR product revealed a band approximately 280 bases in 
size, which was subdoned using a TA cloning kit (Invitrogen) according to 
manufacturer's suggested protocol. Ten resulting colonies were cultured, and the 
DNA isolated and sequenced. An additional 60 bases of 5 ' sequence were 
15 identified by this method, which correspond to bases 1 to 60 in SEQ ID NO: 45. 

Characteristics of the Mouse cDNA and Predicted Amino Acid Sequence 

A composite sequence of the mouse cDNA encoding a putative 
homolog of human a d is set out in SEQ ID NO: 45. Although homology between 
the external domains of the human and mouse clones is high, homology between 
20 the cytoplasmic domains is only 30%. ; The observed variation may indicate C- 
terminal functional differences between the human and mouse proteins. 
Alternatively, the variation in the cytoplasmic domains may result from splice 
variation, or may indicate the existence of an additional /S 2 integrin gene(s). 



BNSDOC1D: <WO 9517412A1J_> 



WO 95/17412 



PCT/US94/14832 



-62- 

At the amino acid level, the mouse cDNA predicts a protein (SEQ 
ID NO: 46) with 28% identity to mouse CDlla, 53% identity to mouse CDllb, 
28% identity to human CDlla, 55% identity to human CDllb, 59% identity to 
human CDllc, and 70% identity to human a d . Comparison of the amino acid 
5 sequences of the cytoplasmic domains of human oc d and the putative mouse 
homolog indicates regions of the same length, but having divergent primary 
structure. Similar sequence length in these regions suggests species variation 
rather than splice variant forms. When compared to the predicted rat polypeptide, 
Example 16, supra, mouse and rat cytoplasmic domains show greater than 60% 
10 identity. 

Example 20 

Isolation of additional mouse ad cDNA clones for sequence verification 

In order to verify the nucleic arid amino acids sequences describe 

in Example 19 for mouse a d , additional mouse sequences were isolated for the 
15 purposes of confirmation. 

Isolation of mouse cDNA by hybridization with two homologous 

a d probes (3 " and 5 ') was performed using both a mouse splenic random primed 

library and an oligo dT-primed cDNA library in lambda ZAP II (Strategene). 

The library was plated at 5 x 10 5 phage per 15 cm LBM plate. Plaques were 
20 lifted on Hybond nylon membranes (Amerstiam), and the membranes were 

denatured (0.5 M NaOH/1.5 M NaCl), neutralized (0.5 M Tris Base/1.5 M NaCl 

7 -11.6 M HC1) and washed (2X SSC salt solution). DNA was cross-lined to 

filters by ultraviolet irradiation. 

Probes were generated using primers described below in a PCR 
25 reaction under the following' conditions. Samples were held at 94°C for 4 

minutes and then run through 30 cycles of the temperature step sequence (94 °C 

for 15 seconds; 50°C for 30 seconds; 72°C for 1 minute in a Perkin-Elmer 9600 

thennocycler). 
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The 3 ' probe was approximately 900 bases long and spanned a 
region from nucleotides 2752 to 3651 (in SEQ ID NO: 1) (5 ' -* 3 ') and was 
produced with primers ll.b-l/2FORll and ll.b-l/2REV2 as shown in SEQ ID 
NOS: 69 and 74, respectively. This probe was used in a first set of lifts. 

The 5 ' probe was approximately 800 bases long and spanned a 
region from nucleotides 149 to 946 (in SEQ ID NO: 1) (5 ' -* 3 ') and was 
produced with primers ll.b-l/2FORl and Il.a-1/1REV1 as shown in SEQ ID 
NOS: 50 and 85, respectively). This probe was used in a second set of lifts. 

In a third set of lifts, both probes described above were used 
together on the same plates. 

Approximately 500,000 plaques were screened using the two probes 
from above which„ were labeled in the same way as described in Example 17. 
Labeled probes were, added to a prehybridization solution, containing 45% 
formamide, and incubated overnight at 50°C. Filters were washed twice in 2X 
SSC/0.1 % SDS at rpqm temperature (22°C). A final wash was carried out in 2X 
SSC/0.1 % SDS at 50°C. Autoradiography was for 19 hours at -80°C on Kodak 
X-Omat AR film with an intensifying screen. 

Thirteen plaques giving positive signals on at least duplicate lifts 
were subjected to a secondary screen performed as described for the initial screen 
except that both the 3 ' and 5 ' labeled probes were used for hybridization and an 
additional final wash was incorporated using 2X SSC/0.1 % SDS at 65 'C. 
Autoradiography was performed as described above for 2.5 hours. 

Thirteen plaques (designated MS2P1 through MS2P13) giving 
positive signals were transferred into low Mg ++ phage diluent. Insert size was 
determined by PGR amplification (Perkin-Elmer 9600 thermocycler) using T3 and 
T7 primers which anneal to Bluescript phagemid in ZAP II (sequence previously 
described) under the same conditions shown above. Band sizes ranged from 500 
bases to 4Kb. Phagemids were isolated, prepared, and sequenced with M13 and 
M13 reverse. 1 primers (SEQ ID NOS: 40 and 41, respectively). Five of the 
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thirteen clones; MS2P-3, MS2P-6, MS2P-9, MS2P-12, and MS2P-13, were 
sequenced, and together, represented a region from approximately base 200 at the 
5 ' end to about 300 bases past a first stop codon at the 3 ' end. 

Automated sequencing was performed as described in Example 4 
by first using M13 and M13 reverse. 1 primers (SEQ ID NOS: 40 and 41, 
respectively) to sequence the ends of each clone and to determine its position 
relative to construct #17 (SEQ ID NO: 45). Each clone was then completely 
sequenced using the appropriate primers (listed below) for that particular region. 



li b- 1/2FOR1 5 '-GCAGCCAGCTTCGGACAGAC-3 ' (SEQ ID NO: 50) 

Il.a-1/1F0R2 5 '-CCGCCTGCCACTGGCGTGTGC-3 ' (SEQ ID NO: 60) 

I l.a-l/lFOR3 5 '-CCCAGATGAAGGACTTCGTCAA-3 ' (SEQ ID NO: 61) 

I I .b-l/2FOR4 5 '-GCTGGGATCATTCGCTATGC-3 ' (SEQ ID NO: 62) 
1 l.b-l/2FOR5 5 '-CAATGGATGGACCAGTTCTGG-3 ' (SEQ ID NO: 63) 

I l.b-l/2FOR6 5 '-CAGATCGGCTCCTACTTTGG-3 ' (SEQ ID NO: 64) 

I I .b-l/2FOR7 5 -CATGGAGCCTCGAGACAGG-3 ' (SEQ ID NO: 65) 

1 1 .b-l/2FOR8 5 ':CCACTGTCCTCGAAGCTGGAG-3 ' (SEQ ID NO: 66) 

ll.b-l/2FOR9 5 '-CTTCGTCCTGTGCTGGCTGTGGGCTC-3 

(SEQ ID NO: 67) 

1 1 .b-l/2FOR10 5 '-CGCCTGGCATGTGAGGCTGAG-3 ' (SEQ ID NO: 68) 
1 1 .b-l/2FORl 1 5 '-CCGTGAtCAGTAGGCAGGAAG-3 ' (SEQ ID NO: 69) 
11.0-1/2F0R12 5 -GTCACAGAGGGAACCTCC-3' (SEQ ID NO: 70) 
' 1 1 .b-l/2FOR13 5 '-GCTCCTGAGTGAGCTCTGAAATCA-3 '(SEQ ID NO: 71) 

I l.b-l/2FOR14 5 ' -G AGATGCTGG ATCTACCATCTGC-3 ' (SEQ ID NO: 72) 

I I .b-l/2FOR15 5 '-CTGAGCTGGGAGATTTTTATGG-3 ' (SEQ ID NO: 73) 
1 1 .b-l/2REV2 5 '-GTGGATCAGCACTGAAATCTG-3 ' (SEQ ID NO: 74) 
1 1 .b-l/2REV3 5 '-CGTTTGAAGAAGCCAAGCTTG-3 ' (SEQ ID NO: 75) 
1 1 .b-l/2REV4 5 '-CACAGCGGAGGTGCAGGCAG-3 ' (SEQ ID NO: 76) 
1 1 .b-l/2REV5 5 '-CTCACTGCTTGCGCTGGC-3 ' (SEQ ID NO: 77) 
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ll.b-l/2REV6 


5' 


-CGGTAAGATAGCTCTGCTGG-3 ' 


(SEQ 


ID 


NO: 


78) 


ll.b-l/2REV7 


5' 


-GAGCCCACAGCCAGCACAGG-3 " 


(SEQ 


ID 


NO: 


79) 


ll.b-l/2REV8 


5' 


-GATCCAACGCCAGATCATACC-3 ' 


(SEQ 


ID 


NO: 


80) 


ll.b-l/2REV9 


5' 


-CACGGCCAGGTCCACCAGGC-3 ' 


(SEQ 


ID 


NO: 


81) 


ll.b-l/2REV10 


5' 


-CACGTCCCCTAGCACTGTCAG-3 ' 


(SEQ 


ID 


NO: 


82) 


ll.b-l/2REVll 


5' 


-CCATGTCCACAGAACAGAGAG-3 ' 


(SEQ 


ID 


NO: 


51) 


ll.b-l/2REV12 


5' 


-TTGACGAAGTCCTTCATCTGGG-3 ' 


(SEQ 


ID 


NO: 


83) 


ll.b-l/2REV13 


5' 


-GAACTGCAAGCTGGAGCCCAG-3 ' 


(SEQ 


ID 


NO: 


84) 


Il.a-1/1REV1 


5' 


-CTGGATGCTGCGAAGTGCTAC-3 ' 


(SEQ 


ID 


NO: 


85) 


Il.a-1/1REV2 


5' 


-GCCTTGGAGCTGGACGATGGC-3 ' 


(SEQ 


ID 


NO: 


86) 



Sequences were edited, aligned, and compared to a previously isolated mouse at 4 
sequence (construct #17, SEQ ID NO: 45). 

Alignment of the new sequences revealed an 18 base deletion in 
construct #17 beginning at nucleotide 2308; the deletion did not cause a shift in 

15 the reading frame! Clone MS2P-9, sequenced as described above, also revealed 
the same 18 base; deletion. The deletion has been observed to occur in 50% of 
mouse clones that include the region but has hot been detected in rat or human cr d 
clones. The eighteen base deletion is characterized by a 12 base palindromic 
sequence AAGCAGGAGCTCCTGTGT (SEQ ID NO: 91). This inverted repeat 

20 in the nucleic acid sequence is self-complementary and may form a loop out, 
causing cleavage during reverse transcription. The mouse a d sequence which 
includes the. additional 18 bases is set forth in SEQ ID NO: 52; the deduced 
amino scid sequence is set forth in SEQ ID NO: 53. 
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Example 21 

In situ hybridizations in Mouse 

Tissue distribution was then determined for mouse a d in order to 
provide a comparison to that in humans, described in Example 6. 
5 A single stranded 200 bp mRNA probe was generated from a DNA 

template, corresponding to nucleotides 3460 to 3707 in the cytoplasmic tail region 
of the murine cDNA, by in vitro RNA transcription incorporating 35 S-UTP 
(Amersham). 

Whole mouse embryos (harvested at days 11-18 after fertilization) 
10 and various mouse tissues, including spleen, kidney, liver, intestine, and thymus, 
were hybridized in situ with the radiolabeled single-skanded mRNA probe. 

Tissues were sectioned at 6 ^n: thickness, adhered to Vectabond 
(Vector Laboratories, Inc., Burlingame, CA) coated slides, and stored at -70°C. 
Prior to use, slides were removed from -70°C and placed at 50°C for 
15 approximately 5 minutes. Sections were fixed in 4% paraformaldehyde for 20 
minutes at 4°C, dehydrated with an increasing ethanol gradient (70-95-100%) for 

1 minute at 4°C at each concentration, and air dried for 30 minutes at room 
temperature. Sections were denatured for 2 minutes at 70°C in 70% 
formamide/2X SSC, rinsed twice in 2X SSC, dehydrated with the ethanol gradient 

20 described supra and air dried for 30 minutes. Hybridization was carried out 
overnight (12-16 hours) at 55°C in a solution containing 35 S-labeled riboprobes 
at 6 x 10 s cpm/section and diethylpyrocarbonate (DEPC)-treated water to give a 
final concentration of 50% formamide, 0.3 M NaCl, 20 mM Tris-HCl, pH 7.5, 
10% dextran sulfate, IX Denhardt's solution, 100 mM dithiothreitol (DTT) and 

25 5 mM EDTA. After hybridization, sections were washed for 1 hour at room 
temperature in 4X SSC/10 mM DTT, 40 minutes at 60°C in 50% formamide/2X 
SSC/10 mM DTT, 30 minutes at room temperature in 2X SSC, and 30 minutes 
at ropm temperature in 0.1X SSG. The sections were dehydrated, air dried for 

2 hours, coated with Kodak NTB2 photographic emulsion, air dried for 2 hours, 
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developed (after storage at 4°C in complete darkness) and counterstained with 
hematoxylin/eosin . 

Spleen tissue showed a strong signal primarily in the red pulp. This 
pattern is consistent with that of tissue macrophage distribution in the spleen, but 
5 does not exclude other cell types. 

Example 22 
Generation of Mouse Expression Constructs 

In order to construct an expression plasmid including mouse cDNA 
sequences exhibiting homology to human a d , inserts from clones A1160 and 

10 B3800 were ligatedi Prior to this ligation, however, a 5' leader sequence, 
including an ; ihitiatihg ^rifethibiiine, was added to clone A1160. A primer 
designated n 5' PGR leader" (SEQ ID NO: 47) was designed to contain: (1) 
identical nonspecific bases at positions 1-6 allowing for digestion; (2) a BamHI 
site (underlined in SEQ H> NO: 47) from positions 7-12 to facilitate subcloning 

15 into an expression vector; (3) a consensus Kozak sequence from positions 13-18, 
(4) a sighal sequence including a codon for an initiating methionine (bold in SEQ 
ID NO: 47), Hand (5) an additional 31 bases Of specifically overlapping 5' 
sequence from clone A 1160 to allow primer annealing. A second primer 
designated *3' end frag" (SEQ ID NO: 48) was used with primer w 5' PGR 

20 leader" to amplify the insert from clone A1160. 

5 '-AGTTAGGGATC£GGCACCATGAC- 

-CTTCGGCAGTGTGATCCTCCTGTGTG-3' (SEQ ID NO: 47) 

5 '-GCTGGACGATGGCATCCAC-3 ' (SEQ ID NO: 48) 

The resulting PGR product did not digest with BamHI, suggesting 
25 that an insufficient number of bases preceded the restriction site, prohibiting 
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recognition by the enzyme. The length of the "tail" sequence preceding the 
BamHl site in the 5 * primer (SEQ ID NO: 47) was increased and PCR was 
repeated on the amplification product from the first PCR. A 5' primer, 
designated mAD.5 \2 (SEQ ID NO: 49), was designed with additional nonspecific 
bases at positions 1-4 and an additional 20 bases specifically overlapping the 
previously employed "5 ' PCR leader" primer sequences. 

5 '-GTAGAGTTACGGATCCGGCACCAT-3 ' (SEQ ID NO: 49) 

Primers "mAD.5 \T and "3 ' end frag" were used together in PCR 
with the product from the ffrst amplification as template. A resulting secondary 
PCR product was subcloned into plasmid pCRtmll (Invitrogen) according to 
manufacturer's suggested protocol and transformed into competent One shot cells 
(Invitrogen). One clone containing the PCR product was identified by restriction 
enzyme analysis using BamHl and EcoBI and sequenced. After the sequence was 
verified, the insert was isolated by digestion with BamHl and EcdSI and gel 
purified. 

The insert from clone B3800 was isolated by digestion with EcoBl 
and Afo/I, gel purified, and added to a ligation reaction which included the 
augmented Al 160 BamHVEcdKL fragment. Ligation was allowed to proceed for 
14 hours at 14°C. Vector pcDNA.3 (Invitrogen), digested with BamHl and Ato/I, 
was added to the ligation reaction with additional ligase and the reaction was 
continued for another 12 hours. An aliquot of the reaction mixture was 
transformed into competent E. coli cells, the resulting colonies cultured, and one 
positive clone identified by PCR analysis with the primers ll.b-l/2FORl and 
ll.b-l/2REVll (SEQ ID NOS: 50 and 51, respectively). These primers bridge 
the A1160 and B3800 fragments, therefore detection of an amplification product 
indicates the two fragments were ligated. The sequence of the positive clone was 
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thyrnidine kinase encoding cassettes. Further analysis of this clone with an I 
domain probe (corresponding to nucleotides 454-1064 in SEQ ID NO: 45) 
indicated that the clone did not contain I domain encoding sequences. 

Using the same I domain probe, the XFKII genomic library was 
5 rescreened. Initially, six positive clones were detected, one of which remained 
positive upon secondary screening. DNA isolated from this clone reacted strongly 
in Southern analysis with an I domain probe. No reactivity was detected using 
the original 750 bp probe, however, indicating that this clone included regions 5 ' 
to nucleotides 1985-2773 of SEQ ID NO: 45.. 

10 Alternatively, the lack of hybridization to the 750 bp probe may 

have suggested that the clone was another member of the integrin family of 
proteins. To determine if this explanation was plausible, the 13 kb insert was 
subcloned into pBluescript SKH + . Purified DNA was sequenced using primers 
corresponding to a d I domain nucleic acid sequences 441-461, 591-612, 717-739, 

15 and reverse 898-918 in SEQ ID NO: 52. Sequence information was obtained 
using only the first 4441-4461 primer, and only the 5 '-most exon of the I domain 
was efficiently amplified. The remainder of the I domain was not amplified. The 
resulting clone therefore comprised exon 6 of the mouse a d gene, and intronic 
sequences to the 3 ' and 5 * end of the exon, Exon 7 was not represented in the 

20 clone. After sequencing, a construct is generated containing neomycin resistance 
and thymidine kinase genes. 

The neomycin resistance (neo 1 ) gene is inserted into the resulting 
plasmid in a manner that interrupts the protein coding sequence of the genomic 
mouse DNA. The resulting plasmid therefore contains a neo r gene within the 

25 mouse genomic DNA sequences, all of which are positioned within a thymidine 
kinase encoding region. Plasmid construction in this manner is required to favor 
homologous recombination over random recombination [Chisaka, et al. , Nature 
* 555:516-520 (1992)]. 
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verified with the primers set out in SEQ ID NOS: 50 and 51, which amplify from 
base 100 to 1405 after the initiating methionine. 

Example 23 
Construction of a Knock-out Mouse 
5 In order to more accurately assess the immunological role of the 

protein encoded- by the putative mouse ot d cDNA, a "knock-out" mouse is 
designed wherein the genomic DNA sequence encoding the putative a d homolog 
is disrupted by homologous recombination. The significance of the protein 
encoded by the disrupted gene is thereby assessed by the absence of the encoded 

10 protein. Generation of "knock-out" mice is described in Deng, et aL, 
MohCelhBioL 23:2134-2140 (1993). 

Design of such a mouse begins with construction of a plasmid 
containing sequences to be "knocked out* by homologous recombination events. 
A 750 base pair fragment of the mouse cDNA (corresponding to nucleotides 1985 

15 to 2733 in SEQ ID NO: 45) was used to identify a mouse genomic sequence 
encoding the putative mouse ot d homolog from a XFIXII genomic library. 
Primary screening resulted in 14 positive plaques, seven of which were confirmed 
by secondary screening. Liquid iysates were obtained from two of the plaques 
giving the strongest signal and ihe X DNA was isolated by conventional methods. 

20 Restriction mapping and Southern analysis confirmed the authenticity of one 

clone, designated 14-1, and the insert DNA was isolated by digestion with Noil. 
This fragment was cloned into Bluescript SKII + . 

In order to identify a restriction fragment of approximately 9 to 14 
kb, a length reported to optimize the probability of homologous recombination 

25 events, Southern hybridization was performed with the 750 bp cDNA probe. 
Prior to hybridization, a restriction map was constructed for clone 14-1. A 12 kb 
fragment was identified as a possible candidate and this fragment was subcloned 
into pBluescript SKH + in a position wherein the mouse DNA is flanked by 
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10 6 dpm/ml. ' Hybridization was. carried out at 42°C for 16-18 hours. Filters 
were washed extensively in 2X SSPE/0. 1 % SDS at room temperature and exposed 
to X-ray film to visualize any hybridizing plaques. 

Two clones with significant sequence homology to human a d were 
5 identified. Clone #2 was approximately 800 bp in length and mapped to the 5 ' 
end of human a d . Clone #2 includes an initiating methionine and complete leader 
sequence. Clone #7 was approximately 1.5 kb and includes an initiating 
methionine. The 5 ' end of clone #7 overlapped that Of clone #2, while the 3 ' 
sequences terminated at a point beyond the I domain sequences. Internal 

10 sequencing of clone #7 is performed using the nested deletions sequencing 
technique. ; o 

The predicted N terminal amino acid sequence for rabbit a d as 
determined from clones #2 and #7 indicated a protein with 73% identity with 
human cr d , 65% identity with mouse a d , and 58% identity with mouse CDllb, 

15 human CDllb, and human CDllc. The nucleic acid sequence fordone #2 is set 
out in SEQ ID NO: 92; the predicted amino acid sequence is set out in SEQ ID 
NO: 93 

Isolation of a full length rabbit a d cDNA is carried out using 
labeled rabbit fragment, clpne # 7, and rescreemng the cDNA library from which 
20 . the fragment was derived. 

Isolation of a rabbit a d clone allows expression of the protein, 
either on the surface of transfectants or as a soluble full length or truncated form. 
This protein is then used as an immunogen for the production of monoclonal 
antibodies for use in rabbit models cf human disease states. 
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Ex ample 24 

Cloning of Rabbit a d - Construction and Screening of the Rabbit cDNA Library 
Identification of human a d homologs in rats and mice led to the 
investigation of the existence of a rabbit homolog which would be useful in rabbit 
5 modeis of human disease states described infra. 

Poly A + RNA was prepared from a whole rabbit spleen using an 
Invitrogen FastTrack kit (San Diego, CA) according to manufacturer's suggested 
protocol and reagents supplied with the kit. From 1.65 g tissue, 73 \x% poly 
A + RNA were isolated. The rabbit spleen RNA was used to construct a ZAP 
10 Express cDNA library using a kit from Stratagene (La Jolla, CA). Resulting 
cDNA was directionaliy cloned into EcoRI and Xhol sites in the lambda arms of 
a pBK-CMV phagemid .vector. Gigapack II Gold (Stratagene) was used to 
package the lambda 2rms into phage particles. The resulting library titer was 
estimated to be approximately 8 x 10 5 particles, with an average insert size of 1.2 
15 kb. 

The library was; amplified once by plating for confluent plaque 
growth , and cell lysate was collected. The amplified library was plated at 
approximately 30,000 plaque forming units (pfu) per 150 mm plate with E. coli 
and the resulting mixture incubated for 12-16 hrs at 37°C to allow plaque 

20 formation. Phage DNA was transferred onto Hybond N + nylon membranes 
(Amersham, Arlington Heights, Illinois). The membranes were hybridized with 
a mixture of two random primed radiolabeled mouse a d PCR DNA probes. The 
first probe was generated from a PCR product spanning nucleotides 149-946 in 
SEQ ID NO: 52. The second probe was from a PCR product spanning 

25 nucleotides 2752-3651 in SEQ ID NO: 52. Probes were labeled by random 
priming (Boehringer Mannheim Random Primed DNA Labeling Kit) and the 
reaction mixture was passed over a Sephadex G-50 column to remove 
- unincorporated nucleotides. The hybridization solution was composed of 5X 
SSPE, 5X Denhardts, 1% SDS, 40% Formamide and the labeled probes at 1 x 
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functions which damage normal host tissue through either specific autoimmune 
responses or as a result of bystander cell damage. 

Disease states in which there is evidence of macrophages playing 
a significant role in the disease process include multiple sclerosis, arthritis, graft 
5 atherosclerosis, some forms of diabetes and inflammatory bowel disease. Animal 
models, discussed below, have been shown to reproduce many of the aspects of 
these human disorders. Inhibitors of a d function are tested in these model 
systems to determine if the potential exists for treating the corresponding human 
diseases. 

10 A. Graft Arteriosclerosis 

Cardiac transplantation is now. the accepted form of therapeutic 
intervention for some types of end-state heart disease. As the use of cyclosporin 
A has increased one year survival rates to 80%, the development of progressive 
graft arteriosclerosis has emerged as the leading cause of death in cardiac 

15 transplants surviving beyond the first year. Recent studies have found that the 
incidence of significant graft arteriosclerosis 3 years following a cardiac transplant 
is in the range of 36-44% [Adams, et al.. Transplantation 55:1115-1119 (1992); 
Adams, et al ; Tramplamation 56:794-799 (1993)]. 

Graft arteriosclerosis typically consists of diffuse, occlusive , intimal 

20 lesions which affect the entire coronary vessel wall y and are often accompanied 
by lipid deposition. While the pathogenesis of graft arteriosclerosis remains 
unknown,. it is presumably linked to histocompatibility differences between donor 
and recipient, and is immunologic in nature. Histologically, the areas of intimal 
thickening are composed primarily of macrophages, although T cells are 

25 occasionally seen. It is therefore possible that macrophages expressing ct d may 
play a significant role in the induction and/or development of graft 
. arteriosclerosis, In such a- case, monoclonal antibodies or small molecule 
inhibitors (for example, soluble ICAM-R) of. .aj function could be given 
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Example 25 

Animal Mod els For Determining a d Therapeutic Utility 

Immunohistologic data in dog and in situ hybridization in rats and 
mice has determined that in spleen a d is expressed primarily by macrophages 
5 present in red pulp and in lymph nodes, ofj is found in medullary cords and 
sinuses. The expression pattern is remarkably similar to what has been reported 
for two murine antigens defined by the monoclonal antibodies F4/80 and SK39. 
While biochemical characterization of these murine antigens has demonstrated that 
they are distinct from a d , it is highly . probably that a d defines the same 

10 macrophage subset as the murine F4/80 and SK39 antigens. 

In mouse. SK39- positive macrophages have been identified in 
splenic red pulp where they may participate, in the . clearance of foreign materials 
from circulation, and in medulla of lymph nodes [Jutila, et al. , J, Leukocyte Biol 
54:30-39 (1993)]. SX39-positive macrophages have also been reported at sites of 

15 both acute and chronic inflammation. Furthermore, monocytes recruited to 
thioglycolate-inflamed peritoneal cavities also express the SK39 antigen. 
Collectively, these findings suggest that, if SK39 + cells are also a d + , then these 
cells are responsible for the clearance of foreign materials in the spleen and 
participate in inflammation where macrophages play a significant role. 

20 While the function of a d remains unclear, other more well 

characterized #2 integriris have been shown to participate in a wide variety of 
adhesion events that facilitate cell migration, enhance phagocytosis, and promote 
cell-cell interactions, events which all lead to upregulation of inflammatory 
processes. Therefore, it is highly plausible that interfering with the normal a d 

25 function may also interfere with inflammation where macrophages play a 
significant role. Such an anti-inflammatory effect could result from: i) blocking 
- macrophage recruitment to sites of inflammation, ii) preventing macrophage 
activation at the site of inflammation or iii) interfering with macrophage effector 
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lumenal surface of the ascending aorta [Rosenfeld, et al. f Arteriosclerosis 7:9-23 
(1987); Rosenfeld, et al , Arteriosclerosis 7:24-34 (1987)]. The atherosclerotic 
lesions seen in these rabbits are simmer to those in humans. Lesions contain 
large numbers of T cells, most of which express CD45RO, a marker associated 
5 with memory T cells. Approximately half of the infiltrating T cells also express 
MHC class H antigen and some express the IL-2 receptor suggesting that many 
of the cells are in an activated state. 

One feature of the atherosclerotic lesions found in cholesterol fed 
rabbits, but apparently absent in rodent models, is the accumulation of foam cell- 

10 rich lesions. Foam cell macrophages are believed to result from the uptake of 
oxidized low-density lipoprotein (LDL) by specific receptors. Oxidized LDL 
particles have been found to be toxic for some cell types including endothelial 
cells and smooth muscle cells. The uptake of potentially toxic, oxidized LDL 
particles by macrophages serves as an irritant and drives macrophage activation, 

15 contributing to the inflammation associated with atherosclerotic lesions. 

Once monoclonal antibodies have been generated to rabbit a d , 
cholesterol fed rabbits are treated. Treatments include prophylactic administration 
of a& monoclonal antibodies or small molecule inhibitors, to demonstrate that 
a d + macrophages are involved in the disease process. Additional studies would 

20 demonstrate that monoclonal antibodies to a d or small molecule inhibitors are 
capable of reversing vessel damage detected in rabbits fed an atherogenic diet. 

C. Insulin-dependent Diabetes 

BB rats spontaneously develop insulin-dependent diabetes at 70- 1 50 
days of age: Using immunohistochemistry, MHC class II + , EDI + macrophages 
25 can be detected infiltrating the islets early in the disease. Many of the 
macrophages appear to be engaged in phagocytosis of cell debris or normal cells. 
As the disease progresses, larger numbers of macrophages are found infiltrating 
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prophylactically to individuals who received heart transplants and are at risk of 
developing graft arteriosclerosis. 

Although atherosclerosis in heart transplants presents the greatest 
threat to life, graft arteriosclerosis is also seen in other solid organ transplants, 
5 including kidneys and livers. Therapeutic use of a d blocking agents could prevent 
graft arteriosclerosis in other organ transplants and reduce complications resulting 
from graft failure. 

One model for graft arteriosclerosis in the rat involves heterotopic 
cardiac allografts .transplanted across minor histocompatibility barriers. When 
10 Lewis cardiac allografts are transplanted into MHC class I and II compatible F- 
344 recipients, 80% of the allografts survive at least 3 weeks, while 25% of the 
; grafts survive indefinitely. During this low-grade graft rejection, arteriosclerosis 
lesions form in the donor heart. Arterial lesions in 120 day old allografts 
typically have diffuse fibrotic intimal thickening indistinguishable in appearance 
15 from graft arteriosclerosis lesions found in rejecting human cardiac allografts. 
Rats are transplanted with hearts mismatched at minor 
histocompatibility antigens, for example Lewis into F-344. Monoclonal antibodies 
specific for rat or d or small molecule inhibitors of a d are given periodically to 
... transplant recipients. Treatment is expected to reduce the incidence of graft 
20 arteriosclerosis in non-rejecting donor hearts. Treatment of rats with a d 
: monoclonal antibodies or small molecule inhibitors may not be limited to 
prophylactic treatments. Blocking a d function is also be expected to reduce 
macrophage mediated inflammation and allow reversal of arterial damage in the 
graft. 

25 B. Atherosclerosis in Rabbits Fed Cholesterol 

< • Rabbits fed an atherogenic diet containing a cholesterol supplement 
for approximately 12-16 weeks develop intimal lesions that cover most of the 
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the islets, although significant numbers of T cells, and later B cells, also appear 
to be recruited to the site [Hanenberg, et al, Diabetologia 32: 126-134 (1989)]. 

Development of diabetes in BB rats appears to depend on both early 
macrophage infiltration and subsequent T cells recruitment. Treatment of BB rats 
5 with silica particles, which are toxic to macrophages, has been effective in 
blocking the early macrophage infiltration of the islets. In the absence of early 
macrophage infiltration, subsequent tissue damage by an autoaggressive 
lymphocyte population fails to occur. Administration of monoclonal antibody 
OX- 19 (specific for rat CD5) or monoclonal antibody OX-8 (specific for rat 

10 CD8), which block the T cell-associated phase of the disease, is also effective in 
suppressing the development of diabetes. 

The central role of macrophages in the pathology of this model 
makes it attractive for testing inhibitors of ot d function. Rats genetically 
predisposed to the development of insulin-dependent diabetes are treated with 

15 monoclonal antibodies to a d or small molecule inhibitors and evaluated for the 
development of the disease. Preventing or delaying clinical onset is evidence that 
a d plays a pivotal role ixi macrophage damage to the islet cells. 

D. Inflammatory Bowel Disease (Crohn's Disease, Ulcerative 

Colitis) 

20 Animal models used in the study of inflammatory bowel disease 

(IBD) are generally elicited by intrarectal administration of noxious irritants (e.g. 
acetic acid or trinitrobenzene sulfonic acid/ethanol). Colonic inflammation 
induced by these agents is the result of chemical or metabolic injury and lacks the 
chronic and spontaneously relapsing inflammation associated with human IBD. 

25 However, a recently described model using subserosal injections of purified 
peptidoglycan-polysacchaiide (PG-PS) polymers from either group A or group D 
streptococci appears to be a more physiologically relevant model for human IBD 
[Yamada, et al, Gastroemerolgy 104:159-111 (1993)]. 
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In this mode] PG-PS is injected into the subserosal layer of the 
distal colon. The resulting inflammatory response is biphasic with an initial acute 
episode three days after injection, which is followed by a spontaneous chronic 
phase three to four weeks later. The late phase response is granulomatous in 
5 nature, and results in colonic thickening, adhesions, colonic nodules and mucosal 
lesions. In addition to mucosal injury, PG-PS colitis frequently leads to arthritis 
anemia and granulomatous hepatitis. The extraintestinal manifestations of the 
disease make the model attractive for studying Crohn's colitis in that a significant 
number of patients with active Crohn's disease suffer from arthritic joint disease 

10 and hepatobiliary inflammation. 

Granulomatous lesions are the result of chronic inflammation which 
leads to the recruitment and subsequent activation of cells of the 
monocyte/macrophage lineage. Presence of granulomatous lesions in Crohn's 
disease and the above animal model make this an attractive clinical target for a d 

15 monoclonal antibodies or other inhibitors of ct d function. Inhibitors of a d 
function are expected to block the formation of lesions associated with IBD or 
even reverse tissue damage seen in the disease. 

E. Arthritis 

Arthritis appears to be a multi-factorial disease process involving 
20 a variety of infhmmatory cell types including neutrophils, T lymphocytes and 
phagocytic macrophages. Although a variety . of arthritis models exist, 
preparations of streptococcal cell wall proteoglycan produce a disorder most 
similar to the human disease. 

In rats, streptococcal cell wall induces inflammation of peripheral 
25 joints characterized by repeated episodes of disease progression followed by 
remission and eventually resulting in joint destruction over a period of several 
. months [Cromartie, et al, J.Exp.Med. 245:1585-1602 (1977); Schwab et al 9 
Infection and Immunity JP:4436-4442 (1991)]. During the chronic phase of the 
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disease, mononuclear phagocytes or macrophages are believed to play a major 
role in destruction of the synovium. Furthermore, agents which suppress the 
recruitment of macrophages into the synovium effectively reduce the inflammation 
and pathology characteristic of arthritis. 
5 A central role for the macrophage in synovium destruction that 

leads to arthritis predicts that monoclonal antibodies to a d or inhibitors of a d 
function may have therapeutic potential in the treatment of this disease. As in 
other models previously described, c* d monoclonal antibodies or small molecule 
inhibitors administered prbphylactically are expected to block or moderate joint 
10 inflammation and prevent destruction of the synovium. Agents that interfere with 
a& function may also moderate ongoing inflammation by preventing the 
recruitment of additional i macrophages to the joint or blocking macrophage 
activation : The net result would be to reverse ongoing destruction of the joint and 
facilitate tissue- repair. 

15 F. Multiple Sclerosis 

Although pathogenesis of multiple sclerosis (MS) remains unclear, 
it is generally accepted that the disease is mediated by CD4 + T cells which 
recognize autoantigens in the central nervous system and initiate an inflammatory 
cascade. The resulting immune response results in the recruitment of additional 

20 inflammatory cells, including activated macrophages which contribute to the 
disease.- Experimental autoimmune encephalomyelitis (EAE) is an animal model 
which reproduces some aspects of MS. Recently, monoclonal antibodies reactive 
with CDllb/CD18 [Huitinga, et aL, Eur.J.lrtvnunoL 23:709-715 (1993)] present 
on inflammatory macrophages have been shown to block both clinical and 

25. histologic disease. The results suggest that monoclonal antibodies or small 
molecule inhibitors to a d are likely to be effective in blocking the inflammatory 
response in EAE. - Such agents also have important therapeutic applications in the 
treatment of MS. 
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Numerous modifications and variations in the invention as set forth 
in the above illustrative examples are expected to occur to those skilled in the art. 
Consequently only such limitations as appear in the appended claims should be 
placed on the invention. 
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(ix) FEATURE: 

(A) NAME/KEY: CDS . 

(B) LOCATION: 3. ,3485 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l:* 

TG ACC TTC GGC ACT GTC CTT CTT CTG AGT GTC CTG GCT TCT TAT CAT 47 

Thr Phe Gly Thr Val Leu Leu Leu Ser Val Leu Ala Ser Tyr His 

15 10 15 

GGA TTC AAC CTG GAT GTG GAG GAG CCT ACG ATC TTC CAG GAG GAT GCA 95 

Gly Phe Asn Leu Asp Val Glu Glu Pro Thr lie Phe Gin Glu Asp Ala 

20 25 30 

GGC GGC TTT GGG CAG AGC GTG GTG CAG TTC GGT GGA TCT CGA CTC GTG 143 

Gly Gly Phe Gly Gin Ser Val Val Gin Phe Gly Gly Ser Arg Leu Val 

35 40 .45 

GTG GGA GCA CCC CTG GAG GTG GTG GCG GCC AAC CAG ACG GGA CGG CTG 191 

Val Gly Ala Pro Leu Glu Val Val Ala Ala Asn Gin Thr Gly Arg Leu 

50 55 60 

TAT GAC TGC GCA GCT GCC ACC GGC ATG TGC CAG CCC ATC CCG CTG CAC 239 

Tyr Asp Cys Ala Ala Ala Thr Gly Met Cys Gin Pro lie Pro Leu His 
65 70 75 

ATC CGC CCT GAG GCC GTG AAC ATG TCC TTG GGC CTG ACC CTG GCA GCC 287 

lie Arg Pro Glu Ala Val Asn Met: Ser Leu Gly Leu Thr Leu Ala Ala 

80 85 .90 95 

TCC ACC AAC GGC TCC CGG CTC CTG GCC TGT GGC CCG ACC CTG CAC AGA 335 

Ser Thr Asn Gly Ser Arg Leu Leu Ala Cys Gly Pro Thr Leu His Arg 

100 105 110 

GTC " TGT GGG GAG AAC TCA TAC TCA AAG GGT TCC TGC CTC CTG CTG GGC 383 

Val Cys Gly Glu Asn Ser Tyr Ser Lys Gly Ser Cys Leu Leu Leu Gly 

115 ;120 " 125 

TCG CGC TGG GAG ATC ATC CAG ACA GTC CCC GAC GCC ACG CCA GAG TGT 431 

Ser Arg Trp Glu lie lie Gin Thr Val Pro Asp Ala Thr Pro Glu Cys 

130 135 140 

CCA CAT CAA GAG ATG GAC ATC GTC TTC CTG ATT GAC GGC TCT GGA AGC 479 

Pro His Gin Glu Met Asp He Val Phe Leu He Asp Gly Ser Gly Ser 
145 • 150 155 

ATT GAC CAA AAT GAC TTT AAC CAG ATG AAG GGC TTT GTC CAA GCT GTC 527 

He Asp Gin Asn Asp Phe Asn Gin Met Lys Gly Phe Val Gin Ala Val 

160 165 : .170 . 175 

ATG GGC CAG TTT GAG GGC ACT GAC ACC CTG TTT GCA CTG ATG CAG TAC 575 

Met Gly Gin Phe Glu Gly Thr Asp Thr Leu Phe Ala Leu Met Gin Tyr 

180 ; .185 . ; 190 

TCA AAC CTC CTG AAG ATC CAC TTC ACC TTC ACC CAA TTC CGG ACC AGC 623 

Ser Asn Leu Leu Lys He His Phe Thr Phe Thr Gin Phe Arg Thr Ser 

195 200 ■ . 205 

CCG AGC CAG CAG AGC CTG GTG GAT CCC ATC GTC CAA CTG AAA GGC CTG 671 

Pro Ser Gin Gin Ser Leu Val Asp Pro He Val Gin Leu Lys Gly Leu 

210 215 220 
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ACG TTC ACG GCC ACG GGC ATC CTG ACA GTG GTG ACA CAG CTA TTT CAT 719 
Thr Phe Thr Ala Thr Gly lie Leu Thr Val Val Thr Gin Leu Phe His 
225 230 235 

CAT AAG AAT GGG GCC CGA AAA AGT GCC AAG AAG ATC CTC ATT GTC ATC 767 
His Lys Asn Gly Ala Arg Lye Ser Ala Lya Lys lie Leu lie Val lie 
240 245 250 255 

ACA GAT GGG CAG AAG TAC AAA GAC CCC CTG GAA TAG AGT GAT GTC ATC 815 
Thr Asp Gly Gin Lys Tyr Lys Asp Pro Leu Glu Tyr Ser Asp Val lie 
260 265 270 

CCC CAG GCA GAG AAG GCT GGC ATC ATC CGC TAC GCT ATC GGG GTG GGA 863 
Pro Gin Ala Glu Lys Ala Gly lie lie Arg Tyr Ala lie Gly Val Gly 
275 280 285 

CAC GCT TTC CAG GCA CCC ACT GCC AGG CAG GAG CTG AAT ACC ATC AGC 911 
His Ala Phe Gin Gly Pro Thr Ala Arg Gin Glu Leu Asn Thr lie Ser 
290 295 300 

TCA GCG CCT CCG CAG GAC CAC GTG TTC AAG GTG GAC AAC TTT GCA GCC 959 
Ser Ala Pro Pro Cln Asp His Val Phe Lys Val Asp Aen Phe Ala Ala 
305 310 315 

CTT GGC AGC ATC CAG AAG CAG CTG CAG GAG AAG ATC TAT GCA GTT GAG 1007 
Leu Gly Ser He Gin Lys Gin Leu Gin Glu Lys lie Tyr Ala Val Glu 
320 325 330 335 

GGA ACC CAC TCC AGG GCA AGC AGC TCC TTC CAG CAC GAG ATG TCC CAA 1055 
Gly Thr Gin Ser Arg Ala Ser Ser Ser Phe Gin His Glu Met Ser Gin 
340 345 350 

GAA GGC TTC AGC ACA GCC CTC ACA ATG GAT GGC CTC TTC CTG GGG GCT 1103 
Glu Gly Phe Ser Thr Ala Leu Thr Met Asp Gly Leu Phe Leu Gly Ala 
355 360 365 

GTG GGG AGC TTT AGC TGG TCT GGA GGT GCC TTC CTG TAT CCC CCA AAT 1151 
Val Gly Ser Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro Pro Asn 
370 $75 380 

ATG AGC CCC ACC TTC ATC AAC ATG TCT CAG GAG AAT GTG GAC ATG AGG 1199 
Met Ser Pro Thr Phe He Asn Met Ser Gin Glu Asn Val Asp Met Arg 
385 390 < 395 

GAC TCT TAC CTC GGT TAC TCC ACC GAG CTA GCC CTG TGG AAG GGG GTA 1247 
Asp Ser Tyr Leu Gly Tyr Ser Thr Glu Leu Ala Leu Trp Lys Gly Val 
400 405 410 415 

CAG AAC CTG GTC CTG GGG GCC CCC CGC TAC CAG CAT ACC GGG AAG GCT 1295 
Gin Asn Leu Val Leu Gly Ala Pro Arg Tyr Gin His Thr Gly Lys Ala 
420 425 430 

GTC ATC TTC ACC CAG GTG TCC AGG CAA TGG AGG AAG AAG GCC GAA GTC 1343 
Val He Phe Thr Gin Val Ser Arg Gin Trp Arg Lys Lys Ala Glu Val 
435 440 445 

ACA GGG ACG CAG ATC GGC TCC TAC TTC GGG GCC TCC CTC TGC TCC GTG 1391 
Thr Gly Thr Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser Val 
450 455 460 ' 
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GAT GTG GAC AGC GAT GGC AGC ACC GAC CTG ATC CTC ATT GGG GCC CCC 1439 
Asp Val Asp Ser Asp Gly Ser Thr Asp Leu lie Leu lie Gly Ala Pro 
465 470 475 

CAT TAC TAT GAG CAG ACC CGA GGG GGC CAG GTG TCC GTG TGT CCC TTG 1487 
His Tyr Tyr Glu Gin Thr Arg Gly Gly Gin Val Ser Val Cys Pro Leu 
480 485 490 495 

CCT AGG GGG CAG AGO GTG CAG TGG CAG TGT GAC GCT GTT CTC CGT GGT 1535 
Pro Arg Gly Gin Arg Val Gin Trp Gin Cys Asp Ala Val Leu Arg Gly 
500 505 510 

GAG CAG GGC CAC CCC TGG GGC CGC TTT GGG GCA GCC CTG ACA GTG TTG 1583 
Glu Gin Gly His Pro Trp Gly Arg Phe Gly A1& Ala Leu Thr Val Leu 
515 520 525 

GGG GAT GTG AAT GAG GAC AAG CTG ATA GAC GTG GCC ATT GGG GCC CCG 1631 
Gly Asp Val Asn Glu Asp Lys Leu lie Asp Val Ala lie Gly Ala Pro 
530 535 540 

GGA GAG CAG GAG AAC CGG GGT GCT GTC. TAC CTG TTT CAC GGA GCC TCA 1679 
: Gly Glu Gin Glu Asn Arg Gly Ala Val, Tyr Leu Phe His Gly Ala Ser 
545 550 555 

GAA TCC GGC ATC AGC CCC TCC CAC AGC CAG CGG ATT GCC AGC TCC CAG 1727 
Glu Ser Gly lie Ser Pro Ser. Hie Ser Gin Arg lie Ala Ser Ser Gin 
560 565 570 575 

CTC TCC CCC AGG CTG CAG TAT TTT GGG CAG GCG CTG AGT GGG GGT CAG 1775 
Leu Ser Pro Arg Leu Gin Tyr Phe Gly Gin Ala Leu Ser Gly Gly Gin 
580 585 590 

GAC CTC ACC CAG GAT GGA CTG .ATG GAC CTG GCC GTG GGG GCC CGG GGC 1823 
Asp Leu Thr, Gin Asp Gly Leu Met Asp Leu Ala Val Gly Ala Arg Gly 
595 600 605 

CAG GTG;, CTC CTG CTC, AGG AGT CTG CO? GTG CTG. AAA GTG GGG GTG GCC 1871 
Gin Val Leu Leu Leu Arg Ser Leu Pro Val Leu Lys Val Gly Val Ala 
610 615 620 

, ATG AGA TTC AGC CCT GTG GAG GTG GCC AAG GCT GTG TAC CGG TGC TGG 1919 
Met Arg Phe Ser Pro Val Glu Val Ala Lys Ala Val Tyr Arg Cys Trp 
625 630 ~ ■ 635 

GAA GAG AAG CCC AGT GCC CTG GAA . GCT GGG GAC GCC ACC GTC TGT CTC 1967 
, Glu Glu Lye Pro Ser Ala Leu- Glu Ala Gly Asp Ala Thr Val Cys Leu 
640 645 650 655 

ACC AITC CAG AAA AGC TCA CTG GAC CAG CTA GGT GAC ATC CAA AGC TCT 2015 
Thr lie Gin Lys Ser . Ser Leu; Asp Gin Leu Gly Asp lie Gin Ser Ser 
660 665 . 670 

GTC AGG TTT GAT CTG GCA CTG GAC CCA GGT CGT CTG ACT TCT CGT GCC 2063 
Val Arg Phe Asp Leu Ala Leu Asp Pro Gly Arg Leu Thr Ser Arg Ala 
675 680 665 

ATT TTC AAT GAA ACC AAG AAC CCC . ACT TTG ACT CGA AGA AAA ACC CTG 2111 
Ile ; Phe Ann Glu Thr Lys. Asn Pro Thr Leu Thr Arg Arg Lys Thr Leu 
690 695 700 
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GGA CTG GGG ATT CAC TGT GAA ACC CTG AAG CTG CTT TTG CCA GAT TGT 2159 
Gly Leu Gly lie His Cys Glu Thr Leu Lys Leu Leu Leu Pro Asp Cys 
705 710 715 

GTG GAG GAT GTG GTG AGC CCC ATC ATT CTG CAC CTC AAC TTC TCA CTG 2207 
Val Glu Asp Val Val Ser Pro lie lie Leu His Leu Ash Phe Ser Leu 
720 725 730 735 

GTG AGA GAG CCC ATC CCC TCC CCC CAG AAC CTG CGT CCT GTG CTG GCC 2255 
Val Arg Glu Pro lis Pro Ser Pro Gin Asn Leu Arg Pro Val Leu Ala 
740 745 750 

GTG GGC TCA CAA GAC CTC TTC ACT GCT TCT CTC CCC TTC GAG AAG AAC 2303 
Val Gly Ser Gin Asp Leu Phe Thr Ala Ser Leu Pro Phe Glu Lys Asn 
755 760 765 

TGT: GGG CAA GAT GGC CTC TGT GAA GGG GAC CTG GGT CTC ACC CTC AGC 2351 
Cys Gly Gin Asp Gly Leu Cys Glu Gly Asp Leu Gly Val Thr Leu Ser 
770 775 780 

TTC TCA, GGC CTG CAG ACC CTG ACC GTG GGG AGC TCC CTG GAG CTC AAC 2399 
Phe Ser Gly Leu Gin Thr Leu Thr Val Gly Ser Ser Leu Glu Leu Asn 
785 790 795 

GTG ATT GTG ACT GTG TGG AAC GCA GGT GAG GAT TCC TAC GGA ACC GTG 2447 
Val He Val Thr Val Trp Asn Ala Gly Glu Asp Ser Tyr Gly Thr val 
800 * 805 810 815 

GTC AGC CTC TAC TAT - CCA GCA GGG CTG TCG CAC CGA CGG GTG TCA GGA 2495 
Val Ser Leu Tyr Tyr Pro Ala Gly Leu Ser His Arg Arg Val Ser Gly 
820 825 830 

GCC CAG AAG CAG CCC CAT CAG ACT GCC CTG CGC CTG GCA TGT GAG ACA 2543 
Ala Gin Lys Gin Pro His Gin Ser Ala Leu Arg Leu Ala Cys Glu' Thr 
835 840 845 

GTG CCC ACT GAG GAT GAG GGC CTA AGA AGC AGC CGC TGC AGT GTC AAC 2591 
Val Pro Thr Glu Asp Glu Gly Leu Arg Ser Ser Arg Cye Ser Val Asn 
850 855 860 

CAC CCC ATC TTC CAT GAG GGC TCT AAC GGC ACC TTC ATA GTC ACA TTC 2639 
His Pro He Phe His Glu Gly Ser A3n Gly Thr Phe lie val Thr Phe 
865 870 875 

GAT GTC TCC TAC AAG GCC ACC CTG GGA GAC AGG ATG CTT ATG AGG GCC 2687 
Asp Val Ser Tyr Lys Ala Thr Leu Gly Asp Arg Met Leu Met Arg Ala 
880 885 890 895 

AGT GCA AGC AGT GAG AAC AAT AAG GCT TCA AGC AGC AAG GCC ACC TTC 2735 
Ser Ala Ser Ser Glu Asn Asn Lys Ala Ser Ser Ser Lys Ala Thr Phe 
900 905 910 

CAG CTG GAG CTC CCG GTG AAG TAT GCA GTC TAC ACC ATG ATC AGC AGG 2783 
Gin Leu Glu Leu Pro Val Lys~ Tyr Ala Val Tyr Thr Met lie Ser Arg 
915 920 925 

CAG GAA GAA TCC ACC AAG TAC TTC AAC- TTT GCA ACC TCC GAT GAG AAG 2831 
Gin Glu Glu Ser Thr Lys Tyr-. Phe Asn Phe Ala Thr Ser Asp Glu Lys 
930 935 940 
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AAA 
Lys 


ATG 
Met 
945 


AAA 
Lys 


GAG 
Glu 


GCT 
Ala 


GAG 
Glu 


CAT 
His 
950 


CGA 
Arg 


TAC 
Tyr 


CGT 
Arg 


GTG 
Val 


AAT 
Asn 
955 


AAC 
Asn 


CTC 
Leu 


AGC 
Ser 


CAG 
Gin 


2879 


CGA 
Arg 
960 


GAT 
Asp 


CTG 
Leu 


GCC 
Ala 


ATC 
He 


AGC 
Ser 
965 


ATT 

He 


AAC 
Asn 


TTC 
Phe 


TGG GTT 
Trp Val 
970 


CCT 
Pro 


GTC 
Val 


CTG 
Leu 


CTG 
Leu 


AAC 
Asn 
975 


2927 


GGG 
Gly 


GTG 
Val 


GCT 
Ala 


GTG 
Val 


TGG GAT GTG GTC 
Trp Asp Val Vel 
980 


ATG 
Met 


GAG 
Glu 
985 


GCC 
Ala 


CCA 
Pro 


TCT 
Ser 


CAG 
Gin 


AGT 
Ser 
990 


CTC 
Leu 


2975 


ccc 
Pro 


TGT 
Cys 


GTT 
Val 


TCA GAG AGA AAA 
Ser Glu Arg Lys 
995 


CCT 
Pro 


CCC CAG 
Pro Gin 
1000 


CAT 
His 


TCT 
Ser 


GAC 
Asp 


TTC CTG 
Phe Leu 
1005 


ACC 
Thr 


3023 


CAG 
Gin 


ATT 
He 


TCA ACA ACT CCC ATG 
Ser Arg Ser Pro Met 
1010 


CTG GAC 
Leu Asp 
1015 


TGC 
Cys 


TCC 

Ser 


ATT 
He 


GCT GAC 
Ala Asp 
1020 


TGC 
Cys 


CTG 
Leu 


3071 


CAG 
Gin 


TTC CCC 
Phe Arg 
1025 


TGT GAC CTC 
Cys Asp Val 


CCC TCC 
Pro Ser 
103 C 


TTC 
Phe 


AGC 
Ser 


GTC 
Val 


CAG GAG 
Gin Glu 
1035 


GAG 
Glu 


CTG 
Leu 


GAT 
Asp 


3119 


TTC ACC 
Phe Thr 
1040 


CTG 
Leu 


AAG GGC AAT CTC 
Lys Gly Asn Leu 
1045 


AGT 

Ser 


TTC 

Phe 


GGC TGG GTC 
Gly Trp Val 
1050 


CGC 
Arg 


GAG 
Glu 


ACA 
Thr 


TTG 
Leu 
1055 


3167 


CAG 
Gin 


AAG 
Lys 


AAG 
Lys 


GTG 
Val 


TTC GTC 
Leu Val 
1060 


GTC 
Val 


AGT 
Ser 


GTG 
Val 


GCT GAA 
Ala Glu 
1065 


ATT 
He 


ACG 
Thr 


TTC 
Phe 


GAC ACA 
Asp Thr 
1070 


3215 


TCC 
Ser 


GTG 
Val 


TAC 
Tyr 


TCC CAG 

Ser Gin 
1075 


CTT 
Leu 


CCA 
Pro 


GGA 
Gly 


CAG GAG 
Gin Glu 
1080 


GCA 
Ala 


TTT 
Phe 


ATG 
Met 


AGA GCT 
Arg Ala 
1085 


CAG 
Gin 


3263 


ATG 
Met 


GAG 
Glu 


ATG GTG 
Met Val 
1090 


CTA 
Leu 


GAA 

Glu 


GAA 
Glu 


GAC GAG 
Asp Glu 
1095 


GTC 

Val 


TAC 

Tyr 


AAT 

Asn 


GCC ATT 
Ala He 
1100 


CCC 
Pro 


ATC 
He 


3311 


ATC 
He 


ATG GGC 
Met Gly 
1105 


AGC 
Ser 


TCT 
Ser 


GTG 
Val 


GGG GCT 
Gly Ala 
1110 


CTG 
Leu 


CTA 
Leu 


CTG 
Leu 


CTG GCG 
Leu Ala 
1115 


CTC 
Leu 


ATC 
He 


ACA 
Thr 


3359 


GCC 
Ala 
1120 


ACA 
Thr 

i.. 


CTG 
Leu 


TAC 
Tyr 


AAG 
Lys 


CTT GGC 
Leu Gly 
1125 


TTC 
Phe 


TTC 
Phe 


AAA 
Lys 


CGC CAC 
Arg His 
1130 


TAC 
Tyr 


AAG 

Lys 


GAA 
Glu 


ATG 
Met 
1135 


3407 


CTG 
Leu 


GAG GAC 
Glu Asp 


AAG 
Lys 


CCT GAA GAC 
Pro Glu Asp 
1140 


ACT 
Thr 


GCC 
Ala 


ACA 
Thr 
1145 


TTC 
Phe 


AGT GGG 
Ser Gly 


GAC 

Asp 


GAT 
Asp 
1150 


TTC 
Phe 

► 


3455 


AGC 
Ser 


TGT GTG 
Cys Val 


GCC 
Ala 


CCA 
Pro 


AAT 
Asn 


GTG 
Val 


CCT 
Pro 


TTG 
Leu 


TCC 
Ser 


TAATAATCCA CTTTCCTGTT 


3505 



1155 1160 

TATCTCTACC ACTGTGGGCT GGACTTGCTT GCAACCATAA ATCAACTTAC ATGGAAACAA 3566 

CTTCTGCATA GATCTGCACT GGCCTAAGCA ACCTACCAGG TGCTAAGCAC CTTCTCGGAG 3625 

AGATAGAGAT TGTAATGTTT TTACATATCT GTCCATCTTT TTCAGCAATG ACCCACTTTT 3685 
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TACAGAAGCA GGCATGGTGC CAGCATAAAT TTTCATATGC T 3726 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1161 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

. Thr Phe Gly Thr Val Leu Leu Leu Ser Val Leu Ala Ser Tyr His Gly 
15 10 15 

Phe Asn Leu Asp Val Glu Gl.u Pro Thr lie Phe Gin Glu Aep Ala Gly 
- 20 25 30 

Gly Phe Gly Gin Ser Val Val Gin Phe Gly Gly Ser Arg Leu Val Val 

35 40 • 45 

Gly Ala Pro Leu Glu Val Val Ala Ala Asn Gin Thr Gly Arg Leu Tyr 
50 55 60 

Asp CyB Ala Ala Ala Thr Gly Met Cys Gin Pro lie Fro Leu His lie 
65 70 75 80 

Arg Pro Glu Ala Val Asn Met Ser Leu Gly Leu .Thr Leu Ala Ala Ser 
,85 90 95 

Thr Asn Gly Ser Arg Leu Leu Ala Cys Gly Pro Thr Leu His Arg Val 
100 105 110 

Cye Gly Glu Asn Ser Tyr; Ser Lys Gly Ser Cys Leu Leu Leu Gly Ser 
115 120 125 

Arg Trp Glu lie lie Gin Thr Val Pro Asp Ala Thr Pro Glu Cys Pro 
13C . 135 140 

His Gin Glu Met Asp lie Val Phe Leu He Asp Gly Ser Gly "Ser He 
145 ISO 155 160 

Asp Gin Asn Asp Phe Asn Gin Met Lys Gly Phe Val Gin Ala Val Met 

165 170 175 

Gly Gin Phe Glu Gly Thr Asp Thr Leu: Phe Ala Leu Met Gin Tyr Ser 
180 185 190 

Asn Leu Leu Lys lie His Phe Thr Phe Thr Gin Phe Arg Thr Ser Pro 
195 200 205 

Ser Gin Gin Ser Leu val Asp Pro lie val Gin Leu Ly3 Gly Leu Thr 
210 215 220 

Phe Thr Ala Thr Gly He Leu Thr Val Val Thr Gin Leu Phe His His 
225 230 235 240 

Lys Asn Gly Ala Arg Lys. Ser Ala Lys Lys lie Leu He Val lie Thr 
245 250 255 
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Asp Gly Gin Lye Tyr Lys Asp Pre Leu Glu Tyr Ser Asp Val lie Pro 
260 265 270 

Gin Ala Glu Lys Ala Gly lie lie Arg Tyr Ala lie Gly Val Gly His 
275 280 285 

Ala Phe Gin Gly Pro Thr Ala Arg Gin Glu Leu Asn Thr lie Ser Ser 
290 295 300 

Ala Pro Pro Gin Asp His Val Phe Lys Val Asp Asn Phe Ala Ala Leu 
305 310 315 320 

Gly Ser He Gin Lys Gin Leu Gin Glu Lys He Tyr Ala Val Glu Gly 
325 330 335 

Thr Gin Ser Arg Ala Ser Ser Ser Phe Gin Hie Glu Met Ser Gin Glu 
340 345 350 

Gly Phe Ser Thr Ala Leu Thr Met Asp Gly Leu Phe Leu. Gly, Ala Val 
355 3S0 365 

Gly Ser Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro Pro Asn Met 
370 375 380 

Ser Pro Thr Phe He Asn Met Ser Gin Glu Asn Val Asp Met Arg Asp 
385 390 395 400 

Ser Tyr Leu Gly Tyr Ser Thr Glu Leu Ala Leu Trp Lys Gly Val Gin 
405 410 415 

Aen.Leu Val Leu Gly Ala Pro Arg Tyr Gin His Thr Gly Lys Ala Val 
420 425 430 

lie Phe Thr Gin Val Ser Arg Gin Trp Arg Lys Lys Ala Glu Val Thr 
435 440 445 

Gly Thr Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser Val Asp 
450 455 460 

Val Asp Ser Asp Gly Ser Thr Asp Leu He Leu lie Gly Ala Pro His 
465 470 475 480 

Tyr Tyr Glu Gin Thr Arg Gly Gly Gin Val Ser Val Cys Pro Leu Pro 
485 490 495 

Arg Gly Gin Arg Val Gin Trp Gin Cys Asp Ala Val Leu Arg Gly Glu 
500 505 510 

Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu Gly 
515 520 525 

Asp Val Asn Glu Asp Lys Leu lie Asp Val Ala He Gly Ala Pro Gly 
530 535 540 

Glu Gin Glu Asn Arg Gly Ala Val Tyr Leu Phe His Gly Ala Ser Glu 
545 550 555 560 

Ser Gly He Ser Pro Ser His Ser Gin Arg He Ala Ser Sar Gin Leu 
565 570 575 
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Ser Pro Arg Leu Gin Tyr Phe Gly Gin Ala Leu Ser Gly Gly Gin Asp 
580 585 590 

Leu Thr Gin Aap Gly Leu Met Asp Leu Ala Val Gly Ala Arg Cly Gin 
595 600 605 

Val Leu Leu Leu Arg Ser Leu Pro Val Leu Lye Val Gly Val Ala Met 
610 615 620 

Arg Phe Ser Pro Val Glu val Ala Lys Ala Val Tyr Arg Cys Trp Glu 
625 630 635 640 

Glu Lys Pro Ser Ala Leu Glu Ala Gly Asp Ala Thr Val Cys Leu Thr 
645 650 655 

lie Gin Lys Ser Ser. Leu Asp Gin Leu Sly Asp lie Gin Ser Ser Val 
660 665 670 

Arg Phe Asp Leu Ala Leu Asp Pro Gly Arg Leu Thr Ser Arg Ala lie 
675 680 685 

Phe Asn Glu Thr Lys Asm Pro Thr Leu Thr Arg Arg Lys Thr Leu Gly 
690 695 700 

Leu Gly lie His Cys Glu Thr Leu Lys Leu Leu Leu Pro Asp Cys Val 
705 710 715 720 

Glu Asp Val Val Ser. Pro lie .He Leu His Leu Asn Phe Ser Leu Val 
725 .. 730 735 

Arg Glu Pro He.. Pro Ser Pro: Gin Asn Leu Arg Pro Val Leu Ala Val 
740 745 750 

Gly Ser Gin Asp Leu Phe Thr Ala Ser Leu Pro Phe Glu Lys Asn Cys 
755 .760 765 

Gly Gin Asp Gly Leu ; Cys Glu . Gly Asp Leu Gly Val Thr Leu Ser Phe 
770 775 780 

Ser Gly Leu Gin Thr Leu Thr Val Gly Ser Ser Leu Glu Leu Asn Val 
785 790 795 800 

He Val Thr Val Trp Asn Ala Gly Glu Asp Ser Tyr Gly Thr Val Val 



Ser Leu Tyr Tyr Pro Ala Gly Leu Ser His Arg Arg Val Ser Gly Ala 
820 825 830 

Gin Lys Gin Pro His Gin Ser Ala Leu Arg Leu Ala Cys Glu Thr Val 
835 840 845 

Pro Thr Glu Asp Glu Gly Leu Arg Ser Ser Arg Cys Ser Val Asn His 

650 855 860 

Pro lie Phe His Glu Gly Ser Asn Gly Thr Phe He Val Thr Phe Asp 

865 870 875 880 

Val Ser Tyr Lye Ala Thr Leu Gly Asp Arg Met Leu Met Arg Ala Ser 



805 



810 



815 



885 



890 



895 
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Ala Ser Ser Glu Asn Asn Lye Ala Ser Ser Ser Lye Ala Thr Phe Gin 
900 905 910 

Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr Thr Met lie Ser Arg Gin 
915 920 925 

Glu Glu Ser Thr Lys Tyr Phe Asn Phe Ala Thr Ser Asp Glu Lys Lys 
930 935 940 

Met Lys Glu Ala Glu His Arg Tyr Arg Val Asn Asn Leu ser Gin Arg 
945 950 955 960 

Asp Leu Ala lie Ser He Asn Phe Trp Val Pro Val Leu Leu Asn Gly 
965 970 975 

Val Ala Val Trp Asp Val Val Met Glu Ala Pro Ser Gin Ser Leu Pro 
980 985 990 

Cys Val Ser Glu Arg Lys Pro Pro Gin His Ser Asp Phe Leu Thr Gin 
995 1000 1005 

He Ser Arg Ser Pro Met Leu Asp Cys Ser He Ala Asp Cys Leu Gin 
1010 1015 1020 

Phe Arg Cys Asp Val Pro Ser Phe Ser Val Gin Glu Glu Leu Asp Phe 
1025 1030 ' ' 1035 1Q40 

Thr Leu Lys Gly Asn Leu Ser Phe Gly Trp Val Arg Glu Thr Leu Gin 
1045 1050 1055 

Lys Lys Val Leu Val Val Ser Val Ala Glu He Thr Phe Asp Thr Ser 
1060 1065 1070 

Val Tyr Ser Gin Leu Pro Gly Gin Glu . Ala Phe Met Arg Ala Gin Met 
1075 1080 1085 

Glu Met Val Leu Glu Glu Asp Glu Val Tyr Asn Ala He Pro He He 
1090 1095 1100 

Met Gly Ser Ser Val Gly Ala Leu Leu Leu Leu Ala Leu He Thr Ala 
1105 1110 ! 1115 H20 

Thr Leu Tyr Lys Leu Gly Phe Phe Lyo Arg His Tyr Lys Glu Met Leu 
1125 1130 1135 

Glu Asp Lys Pro Glu Asp Thr Ala Thr Phe Ser Gly Asp Asp Phe Ser 
1140 1145 1150 

Cys Val Ala Pro Asn Val Pro Lys Ser 
1155 1160 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
;* fD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ala Leu Arg Val Leu Leu Leu Thr Ala Leu Thr Leu Cys His Gly 
15 10 15 

Phe Asn Leu Asp Thr Glu Asn Ala Met Thr Phe Gin Glu Asn Ala Arg 
20 25 30 

Gly Phe Gly Gin Ser Val Val Gin Leu Gin Gly Ser Arg Val Val Val 
35 40 45 

Gly Ala Pro Gin Glu lie Val Ala Ala Asn Gin Arg Gly Ser Leu Tyr 
50 55 60 

Gin Cys Asp Tyr Ser Thr Gly Ser Cys Glu Pro He Arg Leu Gin Val 
65 70 75 80 

Pro Val Glu Ala Val Asn' Met Ser Leu Gly Leu Ser Leu Ala Ala Thr 
85 90 95 

Thr Ser Pro Pro Gin Leu Leu Ala Cys Gly Pro Thr Val His Gin Thr 
100 105 110 

Cys Ser Glu Asn Thr Tyr Val Lys Gly Leu Cys Phe Leu Phe Gly Ser 
11S 120 125 

Asn Leu Arg Gin Gin Pro Gin Lys Phe Pro Glu Ala Leu Arg Gly Cys 
130 135 140 

Pro Gin Glu Asp Ser Asp lie Ala Phe Leu lie Asp Gly Ser Gly Ser 
145 150 155 160 

He He Pro His Asp Phe Arg Arg Met Lys Glu Phe Val Ser Thr Val 
165 170 175 

Met Glu Gin Leu Lys Lys Ser Lys Thr Leu Phe Ser Leu Met Gin Tyr 
180 ' 185 190 

Ser Glu Glii Phe Arg He His Phe Thr Phe Lys Glu Phe Gin Asn Asn 
195 200 205 

Pro Asn Pro Arg Ser Leu Val Lys Pro He Thr Gin Leu Leu Gly Arg 
210 215 220 

Thr His Thr Ala Thr Gly He Arg Lys Val Val Arg Glu Leu Phe Asn 
225 230 235 240 

He Thr Asn Gly Ala Arg Lys Asn Ala Phe Lys He Leu Val Val He 
245 250 255 

Thr Asp Gly Glu Lys Phe Gly Asp Pro Leu Gly Tyr Glu Asp Val He 
260 265 270 

Pro Glu Ala Asp Arg Glu Gly Val lie Arg Tyr Val He Gly Val Gly 

275 \ 280 , . t 285 

Asp Ala Phe Arg Ser Glu Lys Ser Arg Gin Glu Leu Asn Thr He Ala 
290 295 300 
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Ser Lys Pro Pro Arg Asp His Val Phe Gin Val Asn Asn Phe Glu Ala 
305 310 315 320 

Leu Lys Thr He Gin Asn Gin Leu Arg Glu Lys He Phe Ala He Glu 
325 330 335 

Gly Thr Gin Thr Gly Ser Ser Ser Ser Phe Glu His Glu Met Ser Gin 
340 345 350 

Glu Gly Phe Ser Ala Ala He Thr Ser Asn Gly Pro Leu Leu Ser Thr 
355 360 365 

Val Gly Ser Tyr Asp Trp Ala Gly Gly Val Phe Leu Tyr Thr Ser Lys 
370 375 380 

Glu Lys Ser Thr Phe He Asn Met Thr Arg Val Asp Ser Asp Met Asn 
385 390 395 400 

Asp Ala Tyr Leu Gly Tyr Ala Ala Ala He He Leu Arg Asn Arg Val 
405 410 415 

Gin Ser Leu Val Leu Gly Ala Pro Arg Tyr Gin His He Gly Leu Val 
420 " 425 430 

Ala Met Phe Arg Gin Asn Thr Gly Met Trp Glu Ser Asn Ala Asn Val 
435 ' - 44C 445 

Lys Gly Thr Gin He Gly Ala Tyr Phe Gly Ala Ser Leu Cys Ser Val 
450 455 460 

Asp Val Asp Ser Asn Gly Ser Thr Asp Leu Val Leu He Gly Ala Pro 
465 470 475 480 

His Tyr Tyr Glu Gin Thr Arg Gly Gly Gin Val Ser Val Cys Pro Leu 
485 490 495 

Pro Arg Gly Gin Arg Ala Arg Trp Gin Cys Asp Ala Val Leu Tyr Gly 
500 505 510 

Glu Gin Gly Gin Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu 
515 S20 525 

Gly Asp Val Asn Gly Asp Lys Leu Thr Asp Val Ala He Gly Ala Pro 
530 535 540 

Gly Glu Glu Asp Asn Arg Gly Ala Val Tyr Leu Phe His Gly Thr Ser 
54S 5S0 * 555 560 

Gly Ser Gly He Ser Pro Ser His Ser Gin Arg He Ala Gly Ser Lys 
565 . 570 575 

Leu Ser Pro Arg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly Gin 
" 580 ' 585 590 

Asp Leu Thr Met Asp Gly Leu Val Asp Leu Thr Val Gly Ala Gin Gly 
595 - 600 605 

His Val Leu Leu Leu Arg Ser Gin Pro Val Leu Arg Val Lys Ala He 
610 615 620 
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Met Glu Phe Asn Pro Arg Glu Val Ala Arg Asn Val Phe Glu Cys Asn 
625 630 635 640 

Asp Gin Val Val Lys Gly Lys Glu Ala Gly Glu Val Arg Val Cys Leu 
645 650 655 

Hie Val Gin Lys Ser Thr Arg Asp Arg Leu Arg Glu Gly Gin lie Gin 
660 665 670 

Ser Val Val Thr Tyr Asp Leu Ala Leu Asp Ser Gly Arg Pro His Ser 
675 680 685 

Arg Ala Val Phe Asn Glu Thr Lys Asn Ser Thr Arg Arg Gin Thr Gin 
690 695 700 

Val Leu Gly Leu Thr Gin Thr Cys Glu Thr Leu Lys Leu Gin Leu Pro 
705 710 715 720 

Asn Cys He Clu Aop Pro Val Ser Pro He Val Leu Arg Leu Asn Phe 
725 730 735 

Ser Leu Val Gly Thr Pro Leu Ser Ala Phe Gly Asn Leu Arg Pro Val 
740 745 750 

Leu Ala Clu Asp Ala. Gin Arg Leu Phe Thr Ala Leu Phe Pro Phe Glu 
755 760 765 

Lys Asn Cys Gly Asn Asp Asn He Cys Gin Asp Asp Leu Ser He Thr 
770 775 780 

Phe Ser Phe Met Ser Leu Asp Cys Leu Val Val Gly Gly Pro Arg Glu 
785 790 795 800 

Phe Asn Val Thr Val Thr Val Arg Asn Asp Gly Glu Asp Ser Tyr Arg 
805 810 815 

Thr Gin Val Thr Phe Phe Phe Pro Leu Asp Leu Ser Tyr Arg Lys Val 
820 825 830 

Ser Thr Leu Gin Asn Gin Arg Ser Gin Arg Ser Trp Arg Leu Ala Cys 
835 840 845 

Glu Ser Ala Ser Ser Thr Glu Val Ser Gly Ala Leu Lys Ser Thr Ser 
850 855 860 

Cys Ser He Asn His Pro lie Phe Pro Glu Asn Ser Glu Val Thr Phe 
865 870 875 880 

Asn He Thr Phe Asp Val Asp Ser Lys Ala Ser Leu Gly Asn Lys Leu 
865 890 895 

Leu Leu Lys Ala- Asn Val Thr Ser Glu Asn Asn Met Pro Arg Thr Asn 
900 905 910 

Lys Thr Glu Phe Gin Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr Met 
915 920 925 

Val Val Thr Ser His Gly Val Ser Thr Lys Tyr Leu Asn Phe Thr Ala 
930 935 940 
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Ser Glu Asn Thr Ser Arg Val Met Gin His Gin Tyr Gin Val Ser Asn 
945 950 955 960 

Leu Gly Gin Arg Ser Leu Pro lie Ser Leu Val Phe Leu Val Pro Val 
965 970 975 

Arg Leu Asn Gin Thr Val lie Trp Asp Arg Pro Gin Val Thr Phe Ser 
980 985 990 

Glu Asn Leu Ser Ser Thr Cys His Thr Lys Glu Arg Leu Pro Ser His 
995 1000 1005 

Ser Asp Phe Leu Ala Glu Leu Arg Lys Ala Pro Val Val Asn Cys Ser 
1010 1015 1020 

lie Ala Val Cys Gin Arg lie Gin Cys Asp lie Pro Phe Phe Gly He 
1025 1030 1035 1040 

Gin Glu Glu Phe Asn Ala Thr Leu Lys Gly Asn Leu Ser Phe Asp Trp 
1045 1050 1055 

Tyr He Lys Thr Ser His Asn His Leu Leu He Val Ser Thr Ala Glu 
1060 1065 1070 

He Leu Phe Asn Asp Ser Val Phe Thr Leu Leu Pro Gly Gin Gly Ala 
1075 1080 1085 

Phe Val Arg Ser Gin Thr Glu Thr Lys Val Glu Pro Phe Glu Val Pro 
1090 1095 1100 

Asn Pro Leu Pro Leu He Val Gly Ser Ser Val Gly Gly Leu Leu Leu 
1105 1110 1115 1120 

Leu Ala Leu lie Thr Ala Ala Leu Tyr Lys Leu Gly Phe Phe Lys Arg 
1125 1130 1135 

Gin Tyr Lys Asp Met Met Ser Glu Gly Gly Pro Pro Gly Ala Glu Pro 
1140 1145 1150 

Gin 



(2) INFORMATION FOR SEQ ID NO: 4; 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 1163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE5S: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Arg Thr Arg Ala Ala Leu Leu Leu Phe Thr Ala Leu Ala Thr 

1 ,5 .10 15 

Ser Leu Gly Phe Asn Leu Asp Thr Glu Glu Leu Thr Ala Phe Arg Val 
20 25 30 
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Asp Ser Ala Gly Phe Gly Asp Ser Val Val Gin Tyr Ala Asn Ser Trp 
35 40 45 

Val Val Val Gly Ala Pro Gin Lys lie lie Ala Ala Asn Gin lie Gly 
50 55 60 

Gly Leu Tyr Gin Cys Gly Tyr Ser Thr Gly Ala Cys Glu Pro lie Gly 
65 70 75 80 

Leu Gin Val Pro Pro Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 
85 90 95 

Ala Ser Thr Thr Ser Pro Ser Gin Leu Leu Ala Cys Gly Pro Thr Val 
100 105 110 

His His Glu Cys Gly Arg Asn Met Tyr Leu Thr Gly Leu Cys Phe Leu 
115 120 125 

Leu Gly Pro Thr Gin Leu Thr Gin Arg Leu Pro Val Ser Arg Gin Glu 
130 135 140 

Cys Pro Arg Gin Glu Gin Asp lie Val Phe Leu lie Asp Gly Ser Gly 
145 150 155 160 

Ser lie Ser Ser Arg Asn Phe Ala Thr Met Met Asn Phe Val Arg Ala 
165 170 175 

Val lie Ser Gin Phe Gin Arg Pro Ser Thr Gin Phe Ser Leu Met Gin 
180 185 190 

Phe Ser Asn Lys Phe Gin Thr His Phe Thr Phe Glu Glu Phe Arg Arg 
195 200 205 

Thr Ser Asn Pro Leu Ser Leu Leu Ala Ser Val His Gin Leu Gin Gly 
210 215 220 

Phe Thr Tyr Thr Ala Thr Ala lie Gin Asn Val Val His Arg Leu Phe 
225 230 235 240 

His Ala Ser Tyr Gly Ala Arg Arg Asp Ala lie Lys lie Leu lie Val 
245 250 255 

lie Thr Asp Gly Lys Lys Glu Gly Asp Ser Leu Asp Tyr Lys Asp Val 
260 265 * ' 270 

lie Pro Met Ala Asp Ala Ala Gly lie lie Arg Tyr Ala lie Gly Val 
275 280 285 

Gly Leu Ala Phe Gin Asn Arg Asn Ser Trp Lys Glu Leu Asn Asp lie 
290 295 300 

Ala Ser Lys Pro Ser Gin Glu His He Phe Lys Val Glu Asp Phe Asp 
305 310 315 320 

Ala Leu Lys Asp lie Gin Asn Gin Leu Lys Glu Lys lie Phe Ala lie 
325 ' 330 335 



Glu Gly Thr Glu Thr lie Ser Ser Ser Ser Phe Glu' Leu Glu Met Ala 
340 345 350 
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Gin Glu Gly Phe Ser Ala Val Phe Thr Pro Asp Gly Pro Val Leu Gly 
355 360 365 

Ala Val Gly Ser Phe Thr Trp Ser Gly Gly Ala Phe Leu Tyr Pro Pro 
370 375 380 

Asn Met Ser Pro Thr Phe lie Asn Met Ser Gin Glu Asn Val Asp Met 
385 390 395 400 

Arg Asp Ser Tyr Leu Gly Tyr Ser Thr Glu Leu Ala Leu Trp Lys Gly 
405 410 415 

Val Gin Ser Leu Val Leu Gly. Ala Pro Arg Tyr Gin His lie Gly Lys 
420 425 430 

Ala Val XI* Pha lie Gin Val Ser Arg Gin Trp Arg Met Lys Ala Glu 
435 440 445 

Val Il» C.ty Thr Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser 
450 455 460 

Val Aap Val hup Thr Asp Gly Ser Thr Asp. Leu Val Leu He Gly Ala 
465 470 475 480 

Pro His Tyr Tyr Clu Gin Thr Arg Gly Gly Gin Val Ser Val Cys Pro 
485 490 495 

Leu Pro Arg Gly Trp Arg Arg Trp- Trp Cys Asp Ala Val Leu Tyr Gly 
500 505 510 

Glu Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu 
515 520 525 

Gly Asp Val Asn Gly Asp Lys Leu Thr Asp Val Val He Gly Ala Pro 
530 535 540 

Gly Glu Glu Glu Asn Arg Gly Ala Val Tyr Leu Phe His Gly Val Leu 
545 550 555 560 

Gly Pro Ser lie Ser Pro Ser His Ser Gin Arg lie Ala Gly Ser Gin 
565 570 575 

Leu Ser Ser Arg Leu Gin Tyr Phe Gly Gin Ala Leu Ser Gly Gly Gin 
580 585 590 

Asp Leu Thr Gin Asp Gly Leu Val Asn Leu Ala Val Gly Ala Arg Gly 
595 600 * 605 

GJLn Val Leu Leu Leu Arg Thr Arg Pro Val Leu Trp Val Gly Val Ser 
610 615 620 

Met Gin Phe He Pro Ala Glu He Pro Arg Ser Ala Phe Glu Cys Arg 
625 630 635 640 

Glu Gin Val Val Ser Glu Gin Thr Leu Val Gin Ser Asn He Cys Leu 
645 650 655 

Tyr He Asp Lys Arg Ser Lvs A3n Leu Leu Gly Ser Arg Asp Leu Gin 
660 665 670 
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Ser Ser Val Thr Leu Asp Leu Ala Leu Ala Pro Gly Arg Leu Ser Pro 
675 680 685 

Arg Ala He Phe Gin Glu Thr Lys Asn Arg Ser Leu Ser Arg Val Arg 
690 695 700 

Val Leu Gly Leu Lys Ala His Cys Glu Asn Phe Asn Leu Leu Leu Pro 
705 710 715 720 

Ser Cye Val Glu Asp Ser Val He Pro He He Leu Arg Leu Asn Phe 
725 730 735 

Thr Leu Val Gly Lys Pro Leu Leu Ala Phe Arg Asn Leu Arg Pro Met 
740 745 750 

Leu Ala Ala Leu Ala Gin Arg Tyr Phe Thr Ala Ser Leu Pro Phe Glu 
755 760 765 

Lys Asn Cys Gly Ala Asp His He Cys Gin Asp Asn Leu Gly He Ser 

770 - 1 775 780 

Phe Ser Phe Pro Gly Leu Lys Ssr Leu Leu Val Gly Ser Asn Leu Glu 
785 790 795 800 

Leu Asn Ala Glu Val Met Val Trp Asn Asp Gly Glu Asp Ser Tyr Gly 
805 810 815 

Thr Thr He Thr Ph«a Ser > His Fro" Ala Gly Leu Ser Tyr Arg Tyr Val 
820 825 830 

Ala Glu Gly Gin Lys Gin Gly Gin Leu Arg Ser Leu His Leu Thr Cys 
835 840 845 

Cys Ser Ala Pro Val Gly Ser Gin Gly Thr Trp Ser Thr Ser Cys Arg 
850 855 860 

lie Ash His Leu lie Phe Arg Gly Gly Ala Gin lie Thr Phe Leu Ala 
865 870 875 880 

Thr Phe Asp Vai Ser Pro Lys Ala Val Gly Leu Asp Arg Leu Leu Leu 
885 890 895 

lie Ala' Asn Val Ser Ser Giu Asn Asn lie Pro Arg Thr Ser Lys Thr 
900 905 910 

lie Phe Gin Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr He Val Val 
915 920 925 

Ser Ser His Glu Gin Phe Thr Lys Tyr Leu Asn Phe Ser Glu Ser Glu 
930 ' 935 940 

Glu Lys Glu Ser His Val Ala Met His Arg Tyr Gin Val Asn Asn Leu 
945 950 955 960 

Gly Gin Arg Asp Leu Pro Val Ser -He Asn Phe Trp Val Pro Val Glu 
965 970 975 

Leu Ash Gin Glu Ala Val Trp Met Asp Val Glu Val Ser His Pro Gin 
960 985 990 
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Asn Pro Ser Leu Arg Cys Ser Ser Glu Lys lie Ala Pro Pro Ala Ser 
995 1000 1005 

Asp Phe Leu Ala His lie Gin Lys Asn Pro Val Leu Asp Cys Ser lie 
1010 1015 1020 

Ala Gly Cys Leu Arg Phe Arg Cys Asp Val Pro Ser Phe Ser Val Gin 
1025 1030 1035 1040 

Glu Glu Leu Asp Phe Thr Leu Lys Gly Asn Leu Ser Phe Gly Trp Val 
1045 1050 1055 

Arg Gin lie Leu Gin Lys Lys Val Ser Val Val Ser Val Ala Glu He 
1060 1065 1070 

He Phe Asp Thr Ser val Tyr Ser Gin Leu Pro Gly Gin Glu Ala Phe 
1075 1080 1085 

Met Arg Ala Gin Thr He Thr Val Leu Glu Lys Tyr Lys Val His Asn 
1090 1095 1100 

Pro He Pro Leu He Val Gly Ser Ser He Gly Gly Leu Leu Leu Leu 
1105 1110 • ins H20 

Ala Leu He Thr Ala Val Leu Tyr Lys Val Gly Phe Phe Lys Arg Gin 
1125 1130 1135 

Tyr Lys Glu Met Met Glu Glu Ala Asn Gly Gin He Ala Pro Glu Asn 
1140 1145 H50 

Giy Thr Gin Thr Pro Ser Pro Pro Ser Glu Lys 
1155 1160 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Phe Asn Leu Asp Val Glu Glu Pro Met Val Phe Gin 
15 10 

(2) INFORMATION FOR SEQ ID NO: 6: ' 

(i) SEQUENCE CHARACTERISTICS :- 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TTYAAYYTGG AYGTNGARGA RCCNATGGTN TTYCA 35 
(2) INFORMATION FOR SEQ ID NO: 7: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TTCAACCTGG ACGTGGAGGA GCCCATGGTG TTCCAA 36 

(2) INFORMATION FOR SEQ ID NOsB: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) , STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TTCAACCTGG ACGTNGAASA NCCCATGGTC TTCCAA ' 36 

(2) INFORMATION FOR SEQ ID NQ:9t: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: ;. SEQ ID NO:9: \ . . . 
TTYAAYYTNG AYGTNGARGA RCC , 23 

(2) INFORMATION FOR SEQ ID NO: 10: " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pair 3 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTYAAYYTGG ACGTNGAAGA 20 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGRAANACCA TNGGYTC 17 
(2) INFORMATION FOR SEQ ID NO: 12: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IS base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTGGAAGACC ATNGGYTC 18 
(2) INFORMATION FOR SEQ ID NO: 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATTAACCCTC ACTAAAG 17 
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(2) INFORMATION FOR SEQ ID NO: 14; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AATACGACTC ACTATAG 17 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Val Phe Gin Glu Xaa Gly Ala Gly Phe Gly Gin 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Leu Tyr Asp Xaa Val Ala Ala Thr Gly Leu Xaa Gin Pro lie 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: ■ 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

( ii ) MOLECULE TYPE : peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Pro Leu Glu Tyr Xaa Asp Val He Pro Gin Ala Glu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 18; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: IS: 

Phe Gin Glu Gly Phe Ser Xaa Val Leu Xaa 
1 5 -10 

(2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Thr Ser Pro Thr Phe He Xaa Met Ser Gin Glu Asn Val Asp 

1 5 . -••> 10 

(2) INFORMATION FOR SEQ ID NO: 20:. J- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Leu Val Val Gly Ala Pro Leu Glu Val Val Ala Val Xaa Gin Thr 



1 



5 



1C 



15 



Arg 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
Leu Asp Xaa Lys Pro Xaa Asp Thr Ala 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION :' SEQ ID NO: 22: 

Phe Gly Glu Gin Phe Ser Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
RAANCCYTCY TGRAAACTYT C 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



1 



5 
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TTCAACCTGG ACGTGGAGGA GCCCATGGTG TTCAAGAGGA TGGAGCTGGC TTTGGACAGA 60 

GCGTGGCCCA GCTTGGCGGA TCTAGACTCG TGGTGGGAGC CCCCCTGGAG GTGGTGGCGG 120 

TCAACCAAAC AGGAAGGTTG TATGACTGTG TGGCTGCCAC TGGCCTTGTC AACCCATACC 180 

CCTGCACACA CCCCCAGATG CTGTGAACAT GTCCCTGGGT CTGTCCCTGT CAGCCGCCGC 240 

CAGTCGCCCC TGGCTGCTGG CCTGTGGCCC AACCATGCAC AGAGCCTGTG GGGAGAATAT 300 

GTATGCAGAA GGCTTTTGCC TCCTGTTGGA CTCCCATCTG CAGACCATTT GGACAGTACC 360 

TGCTGCCCTA CCAGAGTGTC CAAGTCAAGA GATGGACATT GTCTTCCTGA TTGATGGTTC 420 

TGGCAGTATG AGCAAAGTGA CTTTAAACAA ATGAAGGATT TGTGAGAGCT GTGATGGGAC 480 

AGTTTGAGGG CACCCAAACC CTCTTCTCAC TCATACAGTA TCCCACCTCC CTGAAGATCC 540 

ACTTCACCTT CACGCAATTC CAGAGCAGCT GGAACCCTCT GAGCCTGGTG GATCCCATTG 600 

TCCAACTGGA CGGCCTGACA TATACAGCCA CGGGCATCCG GAAAGTGGTG GAGGAACTGT 660 

TTCATAGTAA GAATGGGGCC CGTAAAAGTG CCAAGAAGAT CCTCATTGTC ATCACAGATG 720 

GCAAAAATAC AAAGACCCCC TGGAGTACGA GGACGTATCC CCAGGCAGAG AGAGCGGATC 780 

ATCCGCTATG CCATTGGGGT GGGAGATGCT TTCTGGAAAC CCAGTGCCAA GCAGGAGCTG 840 

GACAACATTG GCTCAGAGCC GGCTCAGGAC CATGTGTTCA GGGTGGACAA CTTTGCAGCA 900 

CTCAGCAGCA TCCAGGAGCA GCTGCAGGAG AAGATCTTTG CACTCGAAGG AACCCAGTCG 960 

ACGACAAGTA GCTCTTTCCA ACATGAGATG TTCCAAGAAG GGTTCA 1006 
(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE 7 : nucleic acid 

(C) STRANDEDNESSr single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GTNTTYCARG ARGAYGG 17 
(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(Cf STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



BNSDOCIDt <WO 9517412A1_I_> 





WO 95/17412 



PCTAJS94/14832 



- 105 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



CCACTGTCAG GATGCCCGTG 



20 



(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

AGTTACGAAT TCGCCACCAT GGCTCTACGG GTGCTTCTTC TG 42 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
{B) TYPE;, nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO:28: 
AGTTACGAAT TCGCCACCAT GACTCGGACT GTGCTTCTTC TG 42 
(2> INFORMATION FOR SEQ ID NO: 29: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid ; 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: . 
AGTTACGAAT TCGCCACCAT GACCTTCGGC ACTGTG 36 
(2) INFORMATION FOR SEQ ID NO: 3,0:. 

(i) SEQUENCE CHARACTERISTICS : . 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



TTGCTGACTG CCTGCAGTTC 



20 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTTCTGACGC GTAATGGCAT TGTAGACCTC GTCTTC 36 
(2) INFORMATION FOR SEQ ID NO:32: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ACGTATGCAG GATCCCATCA AGAGATGGAC ATCGCT 36 
(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ACTGCATGTC TCGAGGCTGA AGCCTTCTTG GGACATC 37 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 24 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TATAGACTGC TGGGTAGTCC CCAC 24 
(2) INFORMATION FOR SEQ ID NO:35 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
TGAAGATTGC GCGTAAATAA CAGA 24 

(2) INFORMATION FOR SEQ ID NO: 36; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..3456 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 36s 

GGC TGG GCC CTG GCT TCC TGT CAT GGG TCT AAC CTG GAT GTG GAG GAA 48 
Gly Trp Ala Leu Ala Ser Cys His Gly Ser Asn Leu Asp Val Glu Glu 
1 5 10 15 

CCC ATC GTG TTC AGA GAG GAT GCA GCC AGC TTT GGA CAG ACT GTG GTG 96 
Pro lie Val Phe Arg Glu Asp Ala Ala Ser Phe Gly Gin Thr Val Val 
20 25 : 30 

CAG TTT GGT GGA TCT CGA CTC GTG GTG GGA GCC CCT CTG GAG GCG GTG 144 
Gin Phe Gly Gly Ser Arg Leu Val Val Gly Ala Pro Leu Glu Ala Val 
35 40 45 

GCA GTC AAC CAA ACA GGA CGG TTG TAT GAC TGT GCA CCT GCC ACT GGC 192 

Ala Val Asn Gin Thr Gly Arg Leu Tyr Asp Cys Ala Pro Ala Thr Gly 
50 55 60 
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ATG TGC CAG CCC ATC GTA CTG CGC AGT CCC CTA GAG GCA GTG AAC ATG 240 
Met Cys Gin Pro lie Val Leu Arg Ser Pro Leu Glu Ala Val Asn Met 
65 70 75 80 

TCC CTG GGC CTG TCT CTG GTG ACT GCC ACC AAT AAC GCC CAG TTG CTG 288 
Ser Leu Gly Leu Ser Leu Val Thr Ala Thr Asn Asn Ala Gin Leu Leu 
85 90 95 

GCT TGT GGT CCA ACT GCA CAG AGA GCT TGT GTG AAG AAC ATG TAT GCG 336 
Ala Cys Gly Pro Thr Ala Gin Arg Ala Cys Val Lys Asn Met Tyr Ala 
100 105 HO 

AAA GGT TCC TGC CTC CTT CTC GGC TCC AGC TTG CAG TTC ATC CAG GCA 384 
Lys Gly Ser Cys Leu Leu Leu Gly Ser Ser Leu Gin Phe lie Gin Ala 
115 120 125 

GTC CCT GCC TCC ATG CCA GAG TGT CCA AGA CAA GAG ATG GAC ATT GCT 432 
Val Pro Ala Ser Met Pro Glu Cys Pro Arg Gin Glu Met Asp lie Ala 
130 135 140 

TTC CTG ATT GAT GGT TCT GGC AGC ATT AAC CAA AGG GAC TTT GCC CAG 480 
Phe Leu lie Asp Gly Ser Gly Ser lie Asn Gin Arg Asp Phe Ala Gin 
145 150 155 160 

ATG AAG GAC TTT GTC AAA GCT TTG ATG GGA GAG TTT GCG AGC ACC AGC 528 
Met Lys Asp Phe Val Lys Ala Leu Met Gly Glu Phe Ala Ser Thr Ser 
165 170 175 

ACC TTG TTC TCC CTG ATG CAA TAC TCG AAC ATC CTG AAG ACC CAT TTT 576 
Thr Leu Phe Ser Leu Met Gin Tyr Ser Asn lie Leu Lys Thr His Phe 
180 185 190 

ACC TTC ACT GAA TTC AAG AAC ATC CTG GAC CCT CAG AGC CTG GTG GAT 624 
Thr Phe Thr Glu Phe Lys Asn He Leu Asp Pro Gin Ser Leu Val Asp 
195 200 205 

CCC ATT GTC CAG CTG CAA GGC CTG ACC TAC ACA GCC ACA GGC ATC CGG 672 
Pro He Val Gin Leu Gin Gly Leu Thr Tyr Thr Ala Thr Gly He Arg 
210 215 220 

ACA GTG ATG GAA GAG CTA TTT CAT AGC AAG AAT GGG TCC CGT AAA AGT 720 
Thr Val Met Glu Glu Leu Phe His Ser Lys Asn Gly Ser Arg Lys Ser 
225 230 235 240 

GCC AAG AAG ATC CTC CTT GTC ATC ACA GAT GGG CAG AAA TAC AGA GAC 768 
Ala Lys Lys He Leu Leu Val He Thr Asp Gly Gin Lys Tyr Arg Asp 
245 250 255 

CCC CTG GAG TAT AGT GAT GTC ATT CCC GCC GCA GAC AAA GCT GGC ATC 816 
Pro Leu Glu Tyr Ser Asp Val He Pro Ala Ala Asp Lys Ala Gly He 
260 265 270 

ATT CGT TAT GCT ATT GGG GTG GGA GAT GCC TTC CAG GAG CCC ACT GCC 864 
He Arg Tyr Ala lie Gly Val Gly Asp Ala Phe Gin Glu Pro Thr Ala 
275 280 285 

CTG AAG GAG CTG AAC ACC ATT GGC TCA GCT CCC CCA CAG GAC CAC GTG 912 
Leu Lys Glu Leu Asn Thr lie Gly Ser Aia Pro Pro Gin Asp His Val 
290 295 300 
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TTC AAG GTA GGC AAC TTT GCA GCA CTT CGC AGC ATC CAG AGG CAA CTT 960 
Phe Lys Val Gly Asn Phe Ala Ala Leu Arg Ser lie Gin Arg Gin Leu 
305 310 315 320 

CAG GAG AAA ATC TTC GCC ATT GAG GGA ACT CAA TCA AGG TCA AGT AGT 1008 
Gin Glu Lys He Phe Ala He Glu Gly Thr Gin Ser Arg Ser Ser Ser 
325 330 335 

TCC TTT CAG CAC GAG ATG TCA CAA GAA GGT TTC AGT TCA GCT CTC AfcA 1056 
Ser Phe Gin His Glu Met Ser Gin Glu Gly Phe Ser Ser A3 a Leu Thr 
340 345 350 

TCG GAT GGA CCC GTT CTG GGG GCC GYG GGA AGC TTC AGC TGG TCC GGA 1104 
Ser Asp Gly Pro Val Leu Gly Ala Xaa Gly Ser Phe Ser Trp Ser Gly 
355 360 365 

GGT. GCC TTC TTA TAT CCC CCA AAT ACG AGA CCC ACC TTT ATC AAC ATG 1152 
Gly Ala Phe Leu Tyr Pro Pro Asn Thr Arg Pro Thr Phe He Asn Met 
370 375 380 

TCT CAG GAG AAT GTG GAC ATG AGA GAC TCC TAC CTG GGT TAC TCC ACC 1200 
Ser Gin Glu Asn Val Asp Met Arg Asp Ser Tyr Leu Gly Tyr Ser Thr 
385 390 395 400 

GCA GTG GCC TTT TGG hhG GGG GTT CAC AGC CTG ATC CTG GGG GCC CCG 1248 
Ala Val Ala Phe Trp Lys Gly Val His Ser Leu He Leu Gly Ala Pro 
405 410 415 

CGT CAC CAG CAC ACG GGG AAG GTT CTC ATC TTT ACC CAG GAA GCC AGG 1296 
Arg His Gin His Thr Gly Lys Val Val He Phe Thr Gin Glu Ala Arg 
420 425 430 

CAT TGG AGG CCC AAG TCT GAA CTC AGA GGG ACA CAG ATC GGC TCC TAC 1344 
Hxs Trp Arg Pro Lys Ser Glu Val Arg Gly Thr Gin He Gly Ser Tyr 
435 440 445 

TTC GGG GCC TCT CTC TGT TCT GTG GAC CTG GAT AGA GAT GGC AGC ACY 1392 
Phe Gly Ala Ser Leu Cys Ser Val Asp Val Asp Arg Asp Gly Ser Xaa 
450 455 460 

GAC CTG GTC CTG ATC GGA GCC CCC CAT TAC TAT GAG CAG ACC CGA GGG 1440 
Asp Leu Val Leu He Gly Ala Pro His Tyr Tyr Glu Gin Thr Arg Gly 
465 470 475 480 

GGG CAG GTC TCA GTG TKC CCC GTG CCC GGT GTG AGG GGC AGG TGG CAG 1488 
Gly Gin Val Ser Val Xaa Pre Val Pro Gly Val Arg Gly Arg Trp Gin 
485 490 495 

TGT GAG GCC ACC CTC CAC GGG GAG CAG GRC CAT CCT TGG GGC CGC TTT 1536 
Cys Glu Ala. Thr Leu His Gly Glu Gin Xaa His Pro Trp Gly Arg Phe 
500 505 510 

GGG GTG GCT CTG ACA GTG CTG GGG GAC GTA AAC. GGG GAC AAT CTG GCA 1584 
Gly Val Ala Leu Thr Val Leu Gly Asp Val Asn Gly Asp Asn Leu Ala 
515 520 525 

GAC GTG GCT ATT GGT GCC CCT GGA GAG GAG GAG AGC AGA GGT GCT GTC 1632 
Asp Val Ala He Gly Ala Pro Gly Glu Glu Glu Ser Arg Gly Ala Val 
* 530 535 540 
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TAC ATA TTT CAT GGA GCC TCG AG A CTG GAG ATC ATG CCC TCA CCC AGC 1680 
Tyr lie Phe Kis Gly Ala Ser Arg Leu Glu lie Met Pro Ser Pro Ser 
545 550 555 560 

CAG CGG GTC ACT GGC TCC CAG CTC TCC CTG AGA CTG CAG TAT TTT GGG 1728 
Gin Arg Val Thr Gly Ser Gin Leu Ser Leu Arg Leu Gin Tyr Phe Gly 
565 570 575 

CAG TCA TTG AGT GGG GGT CAG GAC CTT ACA CAG GAT GGC CTG GTG GAC 1776 
Gin Ser Leu Ser Gly Gly Gin Asp Leu Thr Gin Asp Gly Leu Val Asp 
580 585 590 

CTG GCC GTG GGA GCC CAG GGG CAC GTA CTG CTG CTC AGG AGT CTG CCT 1824 
Leu Ala Val Gly Ala Gin Gly His Val Leu Leu Leu Arg Ser Leu Pro 
595 600 605 

CTG CTG AAA GTG GAG CTC TCC ATA AGA TTC GCC CCC ATG GAG GTG GCA 1872 
Leu Leu Lys. Val Glu Leu Ser lie Arg Phe Ala Pro Met Glu Val Ala 
610 615 . 620 

AAG GCT GTG TAC CAG TGC TGG GAA AGG ACT CCC ACT GTC CTC GAA GCT 1920 
Lys Ala Val Tyr Gin Cys Trp Glu Arg Thr Pro Thr Val Leu Glu Ala 
625 630 635 640 

GGA GAG GCC ACT GTC TGT CTC ACT GTC CAC AAA GGC TCA CCT GAC CTG 1968 
Gly Glu Ala Thr Val Cys I^u.Thr Val His Lys Gly Ser Fro Asp Leu 
645 650 655 

TTA GGT AAT GTC CAA GCC TCT GTC AGG TAT GAT CTG GCG TTA GAT CCG 2016 
Leu Gly Asn Val Gin Gly Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro 
660 665 670 

GGC CGC CTG ATT TCT CGT GCC ATT TTT GAT GAG ACT AAG AAC TGC ACT 2064 
Gly Arg Leu lie Ser Arg Ala He Phe Asp Glu Thr Lys Asn Cys Thr 
675 680 685 

TTG ACG GGA AGG AAG ACT CTG GGG CTT GGT GAT CAC TGC GAA ACA GTG 2112 
Leu Thr Gly Arg Lys Thr Leu Gly Leu Gly Asp His Cy3 Glu Thr Val 
690 695 700 

AAG CTG CTT TTG CCG GAC TGT GTG GAA GAT GCA GTG AGC CCT ATC ATC 2160 
Lys Leu Leu Leu Pro Asp Cys Val Glu Acp Ala Val Ser Pro He He 
705 710 715 720 

CTG CGC CTC AAC TTT TCC CTG GTG AGA GAC TCT GCT TCA CCC AGG AAC 2208 
Leu Arg Leu Asn Phe Ser Leu Val Arg Asp ser Ala Ser Pro Arg Asn 
725 , . 730 735 

CTG CAT CCT GTG CTG GCT GTG GGC TCA CAA GAC CAC ATA ACT GCT TCT 2256 
Leu His Pro Val Leu Ala Val Gly Ser Gin Asp If is lie Thr Ala Ser 
740 745 750 

CTG CCG TTT GAG AAG AAC TGT AAG CAA GAA CTC CTG TGT GAG GGG GAC 2304 
Leu Pro Phe Glu Lys Asn Cys Lys Gin Glu Leu' Leu Cys Glu Gly Asp 
755 760 765 

CTG GGC ATC AGC TTT AAC TTC TCA GGC CTG CAG GTC TTG GTG GTG GGA 2352 
Leu Gly He Ser Phe Asn Phe Ser Gly Leu Gin Val Leu Val Val Gly 
770 ' 775 780 
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CGC TCC CCA GAG CTC ACT GTG ACA GTC ACT GTC TGG AAT GAG GGT GAG 
Gly Ser Pro Glu Leu Thr Val Thr Val Thr Val Trp Asn Glu Gly Glu 
785 790 795 800 

GAC AGC TAT GGA ACT TTA GTC AAC TTC TAC TAC CCA GCA GGG CTA TCT 
Asp Ser Tyr Gly Thr Leu Val Lys Phe Tyr Tyr Pro Ala Gly Leu Ser 
805 810 815 

TAC CGA CGG GTA ACA GGG ACT CAG CAA CCT CAT CAG TAC CCA CTA CGC 
Tyr Arg Arg Val Thr Gly Thr Gin Gin Pro Hie Gin Tyr Pro Leu Arg 
820 825 830 

TTG GCC TGT GAG GCT GAG CCC GCT GCC CAG GAG GAC CTG AGG AGC AGC 
Leu Ala Cys Glu Ala Glu Pro Ala Ala Gin Glu A3p Leu Arg Ser Ser 
835 840 845 

AGC TGT AGC ATT AAT CAC CCC ATC TTC CGA GAA GGT GCA AAG ACC ACC 
Ser Cys Ser lie Asn His Pro lie Phe Arg Glu Gly Ala Lys Thr Thr 
850 855 860 

TTC ATG ATC ACA TTC GAT GTC TCC TAC AAG GCC TTC CTA GGA GAC AGG 
Phe Met He Thr Phe Asp Val Ser Tyr Lys Ala Phe Leu Gly Asp Ara 
865 870 875 880 



TTG CTT CTG AGG GCC AAA GCC AGC AGT GAG AAT AAT AAG CCT GAT ACC 
Leu Leu Leu Arg Ala Lys -Ala Ser Ser Glu Asn Asn Lys Pro Asp Thr 
885 890 895 

AAC AAG ACT GCC TTC CAG CTG GAG CTC CCA GTG AAG TAC ACC GTC TAT 
Asn Lys Thr Ala Phe Gin Leu Glu Leu Pro Val Lys Tyr Thr Val Tyr 
900 905 910 

ACC CTG ATC ACT AGG CAA GAA GAT TCC ACC AAC CAT GTC AAC TTT TCA 
Thr Leu He Ser Arg Gin Glu Asp Ser Thr Asn His Val Asn Phe Ser 
915 920 925 

TCT TCC CAC GGG GGG AGA AGG CAA GAA GCC GCA CAT CGC TAT CGT GTG 
Ser Ser His Gly Gly Arg Arg Gin Glu Ala Ala His Arg Tvr Arg Val 
930 935 940 " 

AAT AAC CTG AGT CCA CTG AAG CTG GCC GTC AGA GTT AAC TTC TGG GTC 
Asn Asn Leu Ser Pro Leu Lys Leu Ala Val Arg Vai Asn Phe Trp Val 
9 « 950 955 960 

CCT GTC CTT CTG AAC GGT GTG GCT GTG TGG GAC GTG ACT CTG AGC AGC 
Pro Val Leu Leu Asn Gly Val Ala Val Trp Asp Val Thr Leu Ser Ser 
965 970 975 

CCA GCA CAG GGT GTC TCC TGC GTG TCC CAG ATG AAA CCT CCT CAG AAT 
Pro Ala Gin Gly Val Ser Cys Vai Ser Gin Met Lys Pro Pro Glh Asn 
980 985 990 

CCC GAC TTT CTG ACC CAG ATT CAG AGA CGT* TCT GTG CTG GAC TGC TCC 
Pro Asp Phe Leu Thr Gin lie Gin Arg Arg Ser Val Leu Asp Cys Ser 
995 1000 1005 

ATT GCT GAC TGC CTG CAC TCC CGC TGT GAC ATC CCC TCC TTG GAC ATC 
He Ala Asp Cys Leu His Ser Arg Cys Asp He Pro Ser Leu Asp He 
1010 1015 1020 
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CAC GAT GAA CTT GAC TTC ATT CTG AGG GGC AAC CTC AGC TTC GGC TGG 3120 
Gin Asp Glu Leu Asp Phe lie Leu Arg Gly Aen Leu Ser Phe Gly Trp 
1025 1030 1035 1040 

GTC AGT CAG ACA TTG CAG GAA AAG GTG TTG CTT GTG AGT GAG GCT GAA 3168 
Val Ser Gin Thr Leu Gin Glu Lys Val Leu Leu Val Ser Glu Ala Glu 
1045 1050 1055 

ATC ACT TTC GAC ACA TCT GTG TAC TCC CAG CTG CCA GGA CAG GAG GCA 3216 
lie Thr Phe Asp Thr Ser Val Tyr ser Gin Leu Pro Gly Gin Glu Ala 
1060 1065 1070 

TTT CTG AGA GCC CAG GTG GAG ACA ACG TTA GAA GAA TAC GTG GTC TAT 3264 
Phe Leu Arg Ala Gin Val Glu Thr Thr Leu Glu Glu Tyr Val Val Tyr 
1075 1080 . 1085 

CAG CCC ATC TTC CTC GTG GCG GGC AGC TCG GTG GGA GGT CTG CTG TTA 3312 
Glu Pro lie Phe Leu Val Ala Gly Ser Ser Val Gly Gly Leu Leu Leu 
1090 1095 1100 

CTG GCT CTC ATC ACA GTG GTA CTG TAC AAG CTT GGC TYC TYC AAA CGT 3360 
Leu Ala Leu lie Thr Val Val Leu Tyr Lys Leu Gly Xaa Xaa Lys Arg 
1105 1110 1115 : 1120 

CAG TAC AAA GAA ATG CTG GAC GGC AAG GCT GCA GAT CCT GTC ACA GCC 3408 
Gin Tyr Lya Glu Met Leu Asp Gly Lys Ala Ala Asp Pro Val Thr Ala 
1125 1130 H35 

GGC CAG GCA GAT TTC GGC TGT GAG ACT CCT CCA TAT CTC GTG AGC TAGGAATCCA 3463 
Gly Gin Ala Asp Phe Gly Cys Glu Thr Pro Pro Tyr Leu Val Ser 
1140 1145 1150 

CTCTCCTCCC TATCTCTGNA ATGAAGATTG GTCCTGCCTA TGAGTCTACT GGCATGGGAA 3523 



CGAGT 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37j 

Gly Trp Ala Leu Ala Ser Cye His Gly Ser. Asn Leu Asp Val Glu Glu 
1 5 10 15 

Pro He Val Phe Arg Glu Asp Ala Ala Ser Phe Gly Gin Thr Val Val 
20 25 30 

Girt Phe Gly Gly Ser Arg Leu Val Val Gly Ala Pro Leu Glu Ala Val 
35 40 45 

Ala Val Asn Gin Thr Gly Arg, Leu Tyr Asp Cys Ala Pro Ala Thr Gly 
50 55 60 



3528 
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Met Cys Gin Pro He Val Leu Arg Ser Pro Leu Glu Ala Val Asn Met 
65 70 75 80 

Ser Leu Gly Leu Ser Leu Val Thr Ala Thr Asn Asn Ala Gin Leu Leu 
85 90 95 

Ala Cys Gly Pro Thr Ala Gin Arg Ala Cys Val Lys Asn Met Tyr Ala 
100 105 110 

Lys Gly Ser Cys Leu Leu Leu Gly Ser Ser Leu Gin Phe He Gin Ala 
115 120 125 

Val Pro Ala Ser Met Pro Glu Cys Pro Arg Gin Glu Met Asp He Ala 
130 135 140 

Phe Leu He Asp Gly Ser Gly Ser He Asn Gin Arg Asp Phe Ala Gin 
145 150 155 160 

Met Lys Asp Phe Val Lys Ala Leu Met Gly Glu Phe Ala Ser Thr Ser 
165 170 175 

Thr Leu Phe Ser Leu Met Gin Tyr Ser Asn He Leu Lys Thr His Phe 
180 185 190 

Thr Phe Thr Glu Phe Lys Asn He Leu Asp Pro Gin Ser Leu Val Asp 
195 200 205 

Pro He Val Gin Leu Gin Gly Leu Thr Tyr Thr Ala Thr Gly He Arg 
210 215 220 

Thr Val Met Glu Glu Leu Phe His Ser Lys Asn Gly Ser Arg Lys Ser 
225 230 235 240 

Ala Lys Lys He Leu Leu Val He Thr Asp Gly Gin Lys Tyr Arg Asp 
245 250 255 

Pro Leu Glu Tyr Ser Asp Val He Pro Ala Ala Asp Lys Ala Gly lie 
260 265 270 

He Arg Tyr Ala He Gly Val Gly Asp Ala Phe Gin Glu Pro Thr Ala 
275 280 285 

Leu Lys Glu Leu Asn Thr He Gly Ser Ala Pro Pro Gin Asp His Val 
290 295 300 

Phe Lys Val Gly Asn Phe Ala Ala Leu Arg Ser He Gin Arg Gin Leu 
305 310 315 320 

Gin Glu Lye lie Phe Ala He Glu Gly Thr Gin Ser Arg Ser Ser Ser 
325 V 330 335 

Ser Phe Gin His Glu Met Ser Gin Glu Gly Phe Ser Ser Ala Leu Thr 
340 345 350 

Ser Asp Gly Pro Val Leu Gly Ala Xaa Gly Ser Phe Ser Trp Ser Gly 
355 360 365 

Gly Ala Phe Leu Tyr Pro Pro Asn Thr Arg Pro Thr Phe He Asn Met 
370 375 380 



BNSDOCID: <WO_9517412A1_l_> 



WO 95/17412 PCIYUS94/14832 



-114- 

Ser Gin Glu Asn Val Asp Met Arg Asp Ser Tyr Leu Gly Tyr Ser Thr 
385 390 395 400 

Ala Val Ala Phe Trp Lys Gly Val His Ser Leu lie Leu Gly Ala Pro 
405 410 415 

Arg His Gin His Thr Gly Lys Val Val He Phe Thr Gin Glu Ala Arg 
420 425 430 

His Trp Arg Pro Lys Ser Glu Val Arg Gly Thr Gin He Gly Ser Tyr 
435 440 445 

Phe Gly Ala Ser Leu Cys Ser Val Asp Val Asp Arg Asp Gly Ser Xaa 
450 455 460 

Asp Leu Val Leu lie Gly Ala Pro Hie Tyr Tyr Glu Gin Thr Arg Gly 
465 470 475 480 

Gly Gin Val Ser Val Xaa Pro Val Px-o Gly Val Arg Gly Arg Trp Gin 
485 490 495 

Cys Glu Ala Thr Leu Kis Gly Glu Glh Xa£ His Pro Trp Gly Arg Phe 
500 505 510 

Gly Val Ala Leu Thr Val Leu Gly Asp Val Asn Gly Asp Asn Leu Ala 
515 520 * 525 

Asp Val Ala He Gly Ala Fro Gly Glu Glu Glu Ser Arg Gly Ala Val 
530 535 540 

Tyr He Phe His Gly Ala Ser Arg Leu Glu He Met Pro Ser Pro Ser 
545 550 555 560 

Gin Arg Val Thr Gly Ser Gin Leu Ser Leu Arg Leu Gin Tyr Phe Gly 
565 570 575 

Gin Ssr Leu Ser Gly Gly Gin. Asp Leu Thr Gin Asp Gly Leu Val Asp 
580 585 590 

Leu Ala Val Gly Ala Gin Gly His Val Leu Leu Leu Arg Ser Leu Pro 
595 600 605 

Leu Leu Lys Val Glu Leu Ser lie Arg Phe Ala Pro Met Glu Val Ala 
610 615 620 

Lys Ala Val Tyr Gin Cys Trp Glu Arg Thr Pro Thr Val Leu Glu Ala 
625 630 635 640 

Gly Glu AXa Thr Val Cys Leu Thr Val His Lys Gly Ser Pro Asp Leu 
645 650 655 

Leu .Gly. Asn Val. Gin Gly Ser Val Arg. Tyr Asp Leu Ala Leu Asp Pro 
660 665 670 

Gly Arg Leu He Ser Arg Ala He Phe Asp Glu Thr Lys Asn' Cys Thr 
675 680 685 

Leu Thr Gly Arg Lys Thr Leu Gly Leu Gly Asp His Cys Glu Thr Val 
690 695 700 
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Lys Leu Leu Leu Pro Asp Cye Val Glu Asp Ala Val Ser Pro lie He 
705 710 715 720 

Leu Arg Leu Asn Phe Ser Leu Val Arg Asp ser Ala ser Pro Arg Asn 
725 730 735 

Leu His Pro Val Leu Ala Val Gly Ser Gin Asp His lie Thr Ala Ser 
740 745 750 

Leu Pro Phe Glu Lys Asn Cye Lva Gin Glu Leu Leu Cys Glu Gly Asp 
755 760 765 

Leu Gly He Ser Phe Asn Phe Ser Gly Leu Gin Vai Leu Val Val Gly 
770 775 780 

Gly Ser Pro Glu Leu Thr Val Thr Val Thr Val Trp Asn Glu Gly Glu 
785 790 795 800 

Asp Ser Tyr Gly Thr Leu Val Lys Phe Tyr Tyr Pro Ala Gly Leu Ser 
805 > 810 815 

Tyr Arg Arg Val Thr Gly: Thr Gin Gin Pro His Gin Tyr Pro Leu Arg 
820 n; 825 830 

Leu Ala Cvs Glu Ala Glu Pro Ala Ala Gin Glu Asp Leu Arg Ser Ser 
835 840 845 

Ser Cys Ser He Asn His Pro 1 He Phe Arg Glu Gly Ala Lys Thr Thr 
850 855 860 

Phe Met He Thr Phe Asp Val. Ser Tyr Lys Ala Phe Leu Gly Asp Arg 
865 870 875 880 

Leu Leu Leu Arg Ala Lys Ala Ser Ser Glu Asn Asn Lys Pro Asp Thr 
885 890 895 

Asn Lys Thr Ala Phe Gin Leu Glu Leu Pro Val Lys Tyr Thr Val Tyr 
900 905 910 

Thr Leu He Ser. Arg Gin Glu Asp Ser Thr Asn His Val Asn Phe Ser 
915 920 925 

Ser Ser His Gly Gly Arg Arg Gin Glu Ala Ala His Arg Tyr Arg Val 
930 935 940 

Asn Asn Leu Ser Pro Leu Lys Leu Ala Val Arg Val Asn Phe Trp Val 
945 950 * 955 960 

Pro Val LeuLeu Asn Gly Val Ala Val Trp Asp Val Thr* Lsu Ser Ser 
965 970 975 

Pro Ala Gin Gly Val Ser Cys Val Ser Gin Met Lys Pro Pro Gin Asn 
980 985 990 

Pro Asp Phe Leu Thr Gin lie Gin Arg Arg Ser Val Leu Asp Cys Ser 
995 1000 1005 

He Ala Asp Cys Leu His Ser Arg Cys Asp lie Pro Ser Leu Asp He 
1010 1015 1020 
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Gln Asp Glu Leu Asp Phe He Leu Arg Gly Asn Leu Ser Phe Gly Trp 
1025 1030 1035 1040 

Val Ser Gin Thr Leu Gin Glu Lys Val Leu Leu Val Ser Glu Ala Glu 
1045 1050 1055 

He Thr Phe Asp Thr Ser Val Tyr Ser Gin Leu Pro Gly Gin Glu Ala 
1060 1065 1070 

Phe Leu Arg Ala Gin Val Glu Thr Thr Leu Glu Glu Tyr Val Val Tyr 
1075 1080 1085 

Glu Pro He Phe Leu Val Ala Gly Ser Ser Val Gly Gly Leu Leu Leu 
1090 1095 1100 

Leu Ala Leu He Thr Val Val Leu Tyr Lys Leu Gly Xaa Xaa Lys Arg 
1105 1110 1115 1120 

Gin Tyr Lys Glu Met Leu Asp Gly Lys Ala Ala Asp Pro Val Thr Ala 
1125 1130 1135 

Gly Gin Ala Asp Phe Gly Cys Glu Thr Pro Pro Tyr Leu Val Ser 
1140 1145 1150 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : singls 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GTCCAAGCTG TCATGGGCCA G 21 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: ' " 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GTCCAGCAGA CTGAAGAGCA CGG 23 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

{x t, SEQUENCE DESCRIPTION : SEQ ID NO: 40: ^ 
TGTAAAACGA CGGCCAGT 
(2) INFORMATION FOR SEQ ID NO: 41: 

(il SEQUENCE CHARACTERISTICS: 
1 ' it) LENGTH : 19 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA . ; - 

( xi) SEQUENCE DESCRIPTION: SEQ ID HO,41. ^ 
GGAAACAGCT ATGACCATG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

1 ' (A) LENGTH: 22 base pairs 
(Bl TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

( xi) SEQUENCE DESCRIPTION : SEQ ID NO:42: ^ 
GGACATGTTC ACTGCCTCTA GG 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 
( ' (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
GGCGGACAGT CAGACGACTG TCCTG 
(2) INFORMATION FOR SEQ ID NO: 44: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
CTGGTTCGGC CCACCTCTGA AGGTTCCAGA ATCGATAG 38 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 52. .3519 ' . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

GCTTTCTGAA GGTTCCAGAA TCGATAGTGA ATTCGTGGGC ACTGCTCAGA . T ATG GTC 57 

Met Val 
1 

CGT GQA GTT GTG ATC CTC CTG TGT GGC TGG GCC CTG GCT TCC TGT CAT 105 
Arg Gly Val Val He Leu Leu Cys Gly Trp Ala Leu Ala Ser Cys His 
5 10 is 

GGG TCT AAC CTG GAT GTG GAG AAG CCC GTC GTG TTC AAA GAG GAT GCA 153 
Gly Ser Asn Leu Asp Val Glu Lys Pro Val Val Phe Lys Glu Asc Ala 
20 25 30 

GCC AGC TTC GGA CAG ACT GTG GTG CAG ■ TTT GGT GGA TCT CGA CTC GTG 201 
Ala Ser Phe Gly Gin Thr Val Val Gin Phe Gly Gly Ser Arg Leu Val 
35 40 45 50 

GTG GGA GCC CCT CTG GAG GCG GTG GCA GTC AAC CAA ACA GGA CAG TCG 249 
Val Gly Ala Pro Leu Glu Ala Val Ala Val Asn Gin Thr Gly Gin Ser 
55 60 65 

TCT GAC TGT CCG CCT GCC ACT GGC GTG TGC CAG CCC ATC TTA CTC CAC 297 
Ser Asp Cys Pro Pro Ala Thr Gly Val Cys Gin Pro- He Leu Leu His 
70 75 80 

ATT CCC CTA GAG GCA GTG AAC ATG TCC CTG GGC CTG TCT CTG GTG GCT 345 
He Pro Leu Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu Val Ala 
85 90 95 
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GAC ACC AAT AAC TCC CAG TTG CTG GCT TGT GGT CCA ACT GCA CAG AGA 393 
Asp Thr Asn Asn Ser Gin Leu Leu Ala Cys Gly Pro Thr Ala Gin Arg 
100 1° 5 110 

GCT TGT GCA AAG AAC ATG TAT GCA AAA GGT TCC TGC CTC CTT CTG GGC 441 
Ala Cys Ala Lys Asn Met Tyr Ala Lye Gly Ser Cys Leu Leu Leu Gly 
115 120 125 130 

TCC AGC TTG CAG TTC ATC CAG GCA ATC CCT GCT ACC ATG CCA GAG TGT 489 
Ser Ser Leu Gin Phe lie Gin Ala lie Pro Ala Thr Met Pro Glu Cys 
135 140 145 

CCA GCA CAA GAG ATG CAC ATT GCT TTC CTG ATT GAT GGC TCC GGC AGC 537 
Pro Gly Gin Clu Met Asp lie Ala Phe Leu lie Asp Gly Ser Gly Ser 
150 155 160 

ATT GAT CAA ACT GAC TTT ACC CAG ATG AAG GAC TTC GTC AAA GCT TTG 585 
lie Asp Gin Ser Asp Phe Thr Gin Met Lys Asp Phe Val Lys Ala Leu 
165 170 175 

ATG GGC CAG TTG GCG AGC ACC AGC ACC TCG TTC TCC CTG ATG CAA TAC 633 
Met Gly Gin Leu Ala Ser Thr SerThr Ser Phe Ser Leu Met Gin Tyr 
180 185 190 

TCA AAC ATC CTC AAG ACT CAT TTT ACC TTC ACG GAA TTC AAG AGC AGC 681 
Ser -Asn lie Leu Lys Thr His Phe Thr Phe Thr Glu Phe Lys Ser Ser 
195 200 205 210 

CTG AGC CCT CAC AGC CTG GTG &AT GCC ATC GTC CAG CTC CAA GGC CTG 729 
Leu Ser Pro Gin Ser Leu Val Asp Ala lie Val Gin Leu Gin Gly Leu 
215 220 225 

ACG TAC ACA GCC TCC GGC ATC CAG AAA GTG GTG AAA GAG CTA TTT CAT 777 
Thr Tyr Thr Ala Ser Gly lie Gin Lys Val Val Lys Giu Leu Phe His 
230 235 240 

AGC AAG AAT GGG GCC CCA AAA AGT GCC AAG AAG ATA CTA ATT GTC ATC 825 
Ser Lys Asn Gly Aia Arg Lys Ser Ala Lys Lys lie Leu lie Val lie 
245 250 255 

ACA GAT GGG CAG AAA TTC AGA GAC CCC CTG GAG TAT AGA CAT GTC ATC 873 

Thr Asp Gly Gin Lys Phe Arg Asp Pro Leu Glu Tyr Arg Kis Val lie 

260 ; . 265 270 : : 

CCT GAA GCA GAG AAA GCT GGG ATC ATT CGC TAT GCT ATA GGG GTG GGA 921 
Pro Giu Aia Glu Lys Ala Gly lie lie Arg Tyr Ala lie Gly Val Gly 
275 280 285 290 

GAT GCC TTC CGG GAA CCC ACT GCC CTA CAG GAG CTG AAC ACC ATT GGC 969 
Asp Ala Phe Arg Glu Pro Thr Ala Leu Gin Glu Leu Asn Thr lie Gly 

.295 . 300 305 

TCA GCT CCC TCG CAG GAC CAC GTG TTC AAG GTG GGC AAT TTT GTA GCA 1017 
Ser Ala Pro Ser Gin Asp His Val Phe Ly3 Val Gly Asn Phe Val Ala 
310 - 315 320 

CTT CGC AGC ATC CAG CGG CAA ATT CAG GAG AAA ATC TTT GCC ATT GAA 1065 
Leu Arg Ser lie Gin Arg Gin lie Gin Glu Lys lie Phs Ala lie Glu 

325 . 330 335 
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GGA ACC GAA TCA ACG TCA AGT AGT TCC TTT CAG CAC GAG ATG TCA CAA 1113 
Gly Thr Glu Ser Arg Ser Ser Ser Ser Phe Gin His Glu Met Ser Gin 
340 345 350 

GAA GGT TTC AGC TCA GCT CTC TCA ATG GAT GGA CCA GTT CTG GGG GCT 1161 
Glu Gly Phe Ser Ser Ala Leu Ser Met Asp Gly Pro Val Leu Gly Ala 
355 360 365 370 

GTG GGA GGC TTC AGC TGG TCT GGA GGT GCC TTC TTG TAC CCC TCA AAT 1209 
Val Gly Gly Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro Ser Asn 
375 380 385 

ATG AG A TCC ACC TTC ATC AAC ATG TCT CAG GAG AAC GAG GAT ATG AGG 1257 
Met Arg Ser Thr Phe lie Asn Met Ser Gin Glu Asn Glu Asp Met Arg 
390 395 400 

GAC GCT TAC CTC GGT TAC TCC ACC GCA CTG GCC TTT TGG AAG GGG GTC 1305 
Asp Ala Tyr Leu Gly Tyr Ser Thr Ala Leu Ala Phe Trp Lys Gly Val 
405 410 415 

CAC AGC CTG ATC CTG GGG GCC CCT CGC CAC CAG CAC ACG GGG AAG GTT 1353 
His Ser Leu lie Leu Gly Ala Pro Arg His Gin His Thr Gly Lys Val 
420 425 430 

GTC ATC TTT ACC CAG GAA TCC AGG CAC TGG AGG CCC AAG TCT GAA GTC 1401 
Val lie Phe Thr Gin Glu Ser Arg His Trp Arg Pro Lys Ser Glu Val 
435 440 445 450 

AGA GGG ACA CAG ATC GGC TCC TAC TTT GGG GCA TCT CTC TGT TCT GTG 1449 
Arg Gly Thr Gin lie Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser Val 
455 460 465 

GAC ATG GAT AGA GAT GGC AGC ACT GAC CTG GTC CTG ATT GGA GTC CCC 1497 
Asp Met Asp Arg Asp Gly Ser Thr A3p Leu Val Leu lie Gly Val Pro 
470 475 480 

CAT TAC TAT GAG CAC ACC CCA GGG GGG CAG GTG TOG GTC TGC CCC ATG 1545 
His Tyr Tyr Glu His Thr Arg Gly Gly Gin Val Ser Val Cys Pro Met 
485 490 495 

CCT GGT GTG AGG AGC AGG TGG CAT TGT GGG ACC ACC CTC CAT GGG GAG 1593 
Pro Gly Val Arg Ser Arg Trp His : Cys Gly Thr Thr Leu His Gly Glu 
500 505 510 

CAG GCC CAT CCT TOG GGC CGC TTT GGG GOG GCT CTG ACA GTG CTA GGG 1641 
Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu Gly 
515 520 525 530 

GAC GTG AAT GGG GAC AGT CTG GCG GAT GTG GCT ATT GGT GCA CCC GGA 1689 
Asp Val Aon Gly Asp Ser Leu Ala Asp Val Ala He Gly Ala Pro Gly 
535 540 545 

GAC GAG GAG AAC AGA GGT GCT GTC TAC ATA TTT CAT GGA GCC TCG AGA 1737 
Glu Glu Glu Asn Arg Gly Ala Val Tyr lie Phe His Gly Ala Ser Arg 
550 555 560 

CAG GAC ATC GCT CCC TCG CCT AGC CAG CGG GTC ACT GGC TCC CAG CTC 1785 
Gin Asp He Ala Pro Ser Pro Ser Gin Arg Val Thr Gly Ser Gin Leu 
565 570 575 
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TTC CTG AGG CTC CAA TAT TTT GGG CAG TCA TTA AGT GGG GGT CAG GAC 1833 
Phe Leu Arg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly Gin Asp 
580 585 590 

CTT ACA CAG GAT GGC CTG GTG GAC CTG GCC GTG GGA GCC CAG GGG CAC 1881 
Leu Thr Gin Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin Gly His 
595 600 605 610 

GTG CTG CTG CTT AGG AGT CTG CCT TTG CTG AAA GTG GGG ATC TCC ATT 1929 
Val Leu Leu Leu Arg Ser Leu Pro Leu Leu Lys Val Gly He Ser lie 
615 620 625 

AGA TTT GCC CCC TCA GAG GTG GCA A AG ACT GTG TAC CAG TGC TGG GGA 1977 
Arg Phe Ala Pro Ser Glu Val Ala Lys Thr Val Tyr Gin Cys Trp Gly 
630 635 640 

AGG ACT CCC ACT GTC CTC GAA, GCT GGA GAG GCC ACC GTC TGT CTC ACT 2025 
Arq Thr Pro Thr Val Leu Glu Ala Gly Glu Ala Thr Val Cys Leu Thr 
645 650 655 

GTC CGC AAA GGT TCA CCT GAC CTG TTA, GGT GAT GTC CAA AGC TCT GTC 2073 
Val Arg Lys Gly Ser Pro Asp Leu Leu Gly Asp Val Gin Ser Ser Val 
660 665 670 

AGG TAT GAT CTG. GCG . TTG \ GAT CCG GGC CGT CTG ATT. TCT CGT GCC ATT 2121 
Arg Tyr Asp Leu Ala Lea, Asp Pro Gly Arg Leu lie Ser Arg Ala He 
675 680 685 690 

TTT GAT GAG ACG AAG AAC TGC ACT TTG ACC CGA AGG AAG ACT CTG GGG 2169 
Phe Asp Glu Thr Lys Asn Cys Thr Leu Thr Arg Arg Lys Thr Leu Gly 
695^ 700 705 

CTT GGT GAT CAC TGC GAA. ACA ATG AAG CTG CTT TTG CCA GAC TGT GTG 2217 
Leu Gly A3p His Cys Glu, Thr Met Lys Leu Leu Leu Pro Asp Cys Val 
710 715 720 

GAG GAT GCA GTG ACC CCT; ATC ATC CTG CGC CTT AAC TTA TCC CTG CCA 2265 
Glu Asp Ala Val Thr Pro lie lie Leu Arg Leu Asn. Leu Ser Leu Ala 
725 730 735 

GGG GAC TCT GCT CCA TCC AGG AAC CTT CGT CCT GTG CTG GCT GTG GGC 2313 
Gly Asp Ser Ala Pro Ser Arg Asn Leu Arg Pro Val Leu Ala Val Gly 
740 745 750 

TCA CAA GAC CAT GTA ACA GCT TCT TTC CCG TTT GAG AAG AAC TGT GAG 2361 
Ser Gin Asp His Val Thr Ala Ser Phe Pro Phe Glu Lys Asn Cys Glu 
755 760 765 770 

GGG AAC CTG GGC GTC AGC TTC AAC TTC TCA GGC CTG CAG GTC TTG GAG 2409 
Gly .Asn Leu Gly Val Ser Phe. Asn Phe Ser Gly Leu Gin Val Leu Glu 
775 780 785 

GTA GGA AGC TCC CCA GAG CTC ACT GTG ACA GTA ACA .CTT TGG AAT GAG 2457 

Val Gly Ser Ser Pro Glu Leu Thr Val Thr Val Thr Val Trp Asn Glu 
790 795 800 

GGT GAG GAC AGC TAT GGA ACC TTA ATC ..AAG TTC TAC TAC CCA GCA GAG 2505 
Gly Glu Asp Ser Tyr Gly Thr Leu He Lys Phe Tyr Tyr Pro Ala Glu 
805 810 315 
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CTA TCT TAC CGA CGG GTG ACA AGA GCC CAG CAA CCT CAT CCG TAC CCA 
Leu Ser Tyr Arg Arg Val Thr Arg Ala Gin Gin Pro His Pro Tyr Pro 
820 825 830 



2553 



CTA CGC CTG GCA TGT GAG GCT GAG CCC ACG GGC CAG GAG AGC CTG AGG 
Leu Arg Leu Ala Cys Glu Ala Glu Pro Thr Gly Gin Glu Ser Leu Arg 
835 840 845 850 



2601 



AGC AGC AGC TGT AGC ATC AAT CAC CCC ATC TTC CGA GAA GGT GCC AAG 
Ser Ser Ser Cya Ser lie Asn His Pro lie Phe Arg Glu Gly Ala Lya 
855 860 865 



2649 



GCC ACC TTC ATG ATC ACA TTT GAT GTC TCC TAC AAG GCC TTC CTG GGA 
Ala Thr Phe Met He Thr Phe Asp Val Ser Tyr Lys Ala Phe Leu Gly 
870 875 880 



2697 



GAC ACG TTC CTT CTC AGC GCC AGC GCA AGC AGT GAG AAT AAT AAG CCT 2745 
Asp Arg Leu Leu Leu Arg Ala Ser Ala Ser Ser Glu Asn Asn Lys Pro 
885 890 895 

GAA ACC ACC AAC ACT CCC TTC CAG CTG GAG CTT CCG GTG AAG TAC ACG 2793 
Glu Thr Ser Lys Thr Ala Phe Gin Leu Glu Leu Pro Val Lys Tyr Thr 
900 90S 910 

GTC TAT ACC GTG ATC ACT AGG CAG GAA GAT TCT ACC AAG CAT TTC AAC 2841 
Val Tyr Thr Val He Ser Arg Gin Glu Asp Ser Thr Lys His Phe Asn 
915 920 925 930 

TTC TCA TCT TCC CAC CGG GAG AGA CAG AAA GAG GCC GAA CAT CGA TAT 2889 
Phe Ser ser Ser His Gly Glu Arg Gin Lye Glu Ala Glu His Arg Tyr 
935 940 945 

CGT GTG AAT AAC CTG AGT CCA TTG ACG CTG GCC ATC AGC GTT AAC TTC 2937 
Arg Val Asn Asn Leu Ser Pro Leu Thr Leu Ala lie Ser Val Asn Phe 
950 955 960 

TGG GTC CCC ATC CTT CTG AAT GGT GTG GCC GTG TGG GAT GTG ACT CTG 2985 
Trp Val Pro He Leu Leu Asn Gly Val Ala Val Trp Asp Val Thr Leu 

965 970 . 975 

AGG AGC CCA GCA CAG GGT GTC TCC TGT GTG TCA CAG AGG GAA CCT CCT 3033 
Arg Ser Pro Ala Gin Gly Val Ser Cys Val Ser Gin Arg Glu Pro Pro 
980 985 990 

CAA CAT TCC GAC CTT CTG ACC CAG ATC CAA GGA CGC TCT GTG CTG GAC 3081 
Gin His Ser Asp Leu Leu Thr Gin He Gin Gly Arg Ser Val Leu Asp 
995 1000 1005 1010 

TGC GCC ATC GCC GAC TGC CTG CAC CTC CGC TGT GAC ATC CCC TCC TTG 3129 
Cys Ala He Ala Asp Cys Leu. His Leu Arg Cys Asp He Pro Ser Leu 
1015 1020 1025 

GGC ACC CTG GAT GAG CTT GAC TTC ATT CTG AAG GGC AAC CTC AGC TTC 3177 
Gly Thr Leu Asp Glu Lieu Asp Phe He Leu Lys Gly Asn Leu Ser Phe 
1030 „ 1035 1040 

GGC TGG ATC AGT CAG ACA TTG CAG AAA AAG GTG TTG CTC CTG AGT GAG 3225 
Gly Trp He Ser Gin Thr Leu Gin Lys Lye Val Leu Leu Leu Ser Glu 
1045 . 1050 1055 
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OCT GAA ATC ACA TTC AAC ACA TCT GTG TAT TCC CAG CTG CCG GGA CAG 3273 
Ala Glu lie Thr Phe Asn Thr Ser Val Tyr Ser Gin Leu Pro Gly Gin 
1060 1065 1070 

GAG GCA TTT CTG AGA GCC CAG GTG TCA ACG ATG CTA GAA GAA TAC GTG 3321 
Glu Ala Phe Leu Arg Ala Gin Val Ser Thr Met Leu Glu Glu Tyr Val 
1075 1080 1085 1090 

GTC TAT GAG CCC GTC TTC CTC ATG GTG TTC AGC TCA GTG GGA GGT CTG 3369 
Val Tyr Glu Pro Val Phe Leu Met Val Phe Ser Ser Val Gly Gly Leu 
1095 1100 1105 

CTG TTA CTG GCT CTC ATC ACT GTG GCG CTG TAC AAG CTT GGC TTC TTC 3417 
Leu Leu Leu Ala Leu He Thr Val Ala Leu Tyr Lys Leu Gly Phe Phe 
1110 1115 1120 

AAA CGT CAG TAT AAA GAG ATG CTG GAT CTA CCA TCT GCA GAT CCT GAC 3465 
Lys Arg Gin Tyr Lys Glu Met Leu Asp Leu Pro Ser Ala Asp Pro Asp 
1125 1130 1135 

CCA CCC GGC CAG GCA GAT TCC AAC CAT GAG ACT CCT CCA CAT CTC ACG 3513 
Pro Ala Gly Gin Ala Asp Ser Ash His Glu Thr Pro Pro His Leu Thr 
1140 1145 1150 

TCC TAG . * \t • ' ' 3519 

Ser 

1155 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 amino acids 
{ B) .TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Met Val Arg Gly Var Val He Leu Leu eye Gly Tzp Ala Leu Ala Ser 
1 -5 10 15 

Cys His Gly Ser Asn Leu Asp Val Glu Lys Pro Val Val Phe Lys Glu 
20 25 30 ! 

Asp Ala Ala Ser Phe Gly Gin Thr Val Val Gin Phe Gly Gly Ser Arg 
35 40 45 

Leu Val Val Gly Ala Pro Leu Glu Ala Val Ala Val Asn Gin Thr iGly 
50 55 60 

Gin Ser Ser. Asp Cys Pro Pro Ala Thr Gly Val Cys Gin Pro lie Leu 

65 ; 70 : 75 . 80 

Leu His lie Pro Leu Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 

• . 85 - 90 95 

Val Ala Asp Thr Asn Asn Ser Gin Leu Leu Ala Cys Gly Pro Thr Ala 
100 105 110 
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Gin Arg Ala Cys Ala Lys Asn Met Tyr Ala Lys Gly Ser Cys Leu Leu 
115 120 125 

Leu Gly Ser Ser Leu. Gin Phe lie Gin. Ala lie Pro Ala Thr Met Pro 
130 135 140 

Glu Cys Pro Gly Gin Glu Met Asp lie Ala Phe Leu lie Asp Gly Ser 
145 150 155 160 

Gly Ser lie Asp Gin Ser Asp Phe Thr Gin Met Lys Asp Phe Val Lys 
165 170 175 

Ala Leu Met Gly Gin Leu Ala Ser Thr Ser Thr Ser Phe Ser Leu Met 
180 185 190 

Gin Tyr Ser Asn lie Leu Lys Thr Kis Phe Thr Phe Thr Glu Phe Lys 
195 200 205 

Ser Ser Leu Ser Pro Gin Ser Leu Val Asp Ala lie Val Gin Leu Gin 
210 215 220 

Gly Leu Thr Tyr Thr Ala Ser Gly lie Gin Lys Val Val Lys Glu Leu 
225 230 235 240 

Phe Kis Ser Lys Asn Gly Ala. Arg Lys Ssr Ala Lys Lys He Leu He 
245 250 255 

Val He Thr Asp Gly Gin Lys Phe Arg Asp Pro Leu Glu Tyr Arg His 
260 265 270 

Val lie Pro Glu Ala Glu Lys Ala Gly He He Arg Tyr Ala He Gly 
275 280 235 

Val Gly Asp Ala Phe Arg Glu Pro Thr Ala Leu Gin Glu Leu Asn Thr 
2 90 295 300 

He Gly Ser 11a Pro Ser Gin Asp His Val . Phe Lys Val Gly Asn Phe 
305 310 315 320 

Val Ala Leu Arg Ser . He Gin Arg Gin lie .Gin Glu Lys He Phe Ala 
325 330 335 

lie Glu Gly Thr Glu Ser Arg Ser Ser Ser Ser Phe Gin His Glu Met 
340 345 350 

Ser Gin Glu Gly Fhe ser Ser Ala Leu Ser Met Asp Gly Pro Val Leu 
355 360 365 

Gly Ala Val Gly Gly Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro 
370 375 380 

Ser Asn Met Arg Ser Thr Phe He Asn Met Ser Gin Glu Asn Glu Asp 
385 390 . .395 400 

Met Arg Asp Ala Tyr Leu Gly Tyr Ser Thr Ala Leu Ala Phe Trp Lys 
405 410 415 

Gly Val His Ser Leu He Leu Gly Ala Pro Arg His Gin His Thr Gly 
420 425 430 
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Lys Val Val lie Phe Thr Gin Glu Ser Arg His Trp Arg Pro Lys Ser 
.435 440 445 

Glu Val Arg Gly Thr Gin lie Gly Ser Tyr Phe Gly Ala Ser Leu Cys 
450 455 460 

Ser Val Asp Het Asp Arg Asp Gly Ser Thr Asp Leu Val Leu lie Gly 
465 470 475 480 

Val Pro His Tyr Tyr Glu His Thr Arg Gly Gly Gin Val Ser Val Cys 
485 490 495 

Pro Met Pro Gly Val Arg Ser Arg Trp His Cys Gly Thr Thr Leu His 
500 505 510 

Glv Glu Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val 
515 520 525 

Leu Gly Asp Val Asn Gly Asp Ser Leu Ala Asp Val Ala lie Gly Ala 
530 535 540 

Pro Gly Glu Glu Glu Asn Arg Gly Ala Val Tyr lie Phe His Gly Ala 
545 . 550 555 560 

Ser Arc* Gin Asp He. Ala; Pro: Ser' Pro Ser Gin Arg Val Thr Gly Ser 
565 570 575 

Gin Leu Phe Leu Arg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly 
580 585 590 

Gin Asp Leu Thr Gin. Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin 
595 600 605 

Gly His. Val Leu Leu Leu Arg Ser Leu Pro Leu Leu Lys Val Gly He 
610 615 620 

Ser He Arg Phe Ala Pro Ser Glu Val Ala Lys Thr Val Tyr Gin Cys 
625 630 635 640 

Trp Gly Arg Thr Pro Thr Val Leu Glu Ala Gly Glu Ala Thr Val Cys 
645- : 650 655 

Leu. Thr Val Arg Lys Gly Ser Pro Asp Leu Leu Gly Asp Val Gin Ser 
660 665 670 

Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro Gly Arg Leu lie Ser Arg 
675 680 685 

Ala lie Phe Asp Glu Thr Lys Asn Cys Thr Leu Thr Arg Arg Lys Thr 
690 695 700 

Leu Gly Leu Gly Asp His Cys Glu Thr Met Lys Leu Leu Leu Pro Asp 
705 710 *. 715 720 

Cys Val Glu Asp Ala Val Thr Pro lie He Leu Arg Leu Asn Leu Ser 
725 : 730 735 

Leu Ala Gly Asp Ser Ala Pro Ser Arg Asn Leu Arg Pro Val Leu Ala 
740 745 750 
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Val Gly Ser Gin Asp His Val Thr Ala Ser Phe Pro Phe Glu Lys Asn 
755 760 765 

Cys Glu Gly Asn Leu Gly Val Ser Phe Asn Phe Ser Gly Leu Gin Val 
770 775 780 

Leu Glu Val Gly Ser Ser Pro Glu Leu Thr Val Thr Val Thr Val Trp 
785 790 795 800 

Asn Glu Gly Glu Asp Ser Tyr Gly Thr Leu lie Lys Phe Tyr Tyr Pro 
805 810 815 

Ala Glu Leu Ser Tyr Arg Arg Val Thr Arg Ala Gin Gin Pro His Pro 
820 825 830 

Tyr Pro Leu Arg Leu Ala Cys Glu Ala Glu Pro Thr Gly Gin Glu Ser 
835 840 845 

Leu Arg Ser Ser Ser Cys Ser lie Asn His Pro He Fhe Arg Glu Gly 
850 855 860 

Ala Lys Ala Thr Phe Met He Thr Phe Asp Val Ser Tyr Lys Ala Phe 
865 870 . r 875 880 

Leu Gly Asp Arg Leu Leu Leu Arg Ala Ser Ala Ser Ser Glu Asn Asn 
885 890 895 

Lys Pro Glu Thr Ser Lys Thr Ala Phe Gin Leu Glu Leu Pro Val Lys 
900 905 910 

Tyr Thr Val Tyr Thr Val He Sar Arg Gin Glu Aop Ser Thr Lys His 
915 920 925 

Phe Asn Phe Ser Ser Ser His Gly Glu Arg Gin Lys Glu Ala Glu His 
930 935 940 

Arg Tyr Arg Val Asn Asn Leu Ser Pro Leu Thr Leu Ala He Ser Val 
945 950 955 960 

Asn Phe Trp Val Pro He Leu Leu Asn Gly Val Ala Val Trp Asp Val 
965 970 975 

Thr Leu Arg Ser Pro Ala Gin Gly Val Ser Cys Val Ser Gin Arg Glu 
980 985 990 

Pro Pro Gin Hie Ser Asp Leu Leu Thr Gin He Gin Gly Arg Ser Val 
995 , 1000 1005 

Leu Asp Cys Ala He Ala Asp Cys Leu His Leu Arg Cys Asp He Pro 
1010 1015 1020 

Ser Leu Gly Thr Leu Asp Glu Leu Asp Phe He Leu Lys Gly Asn Leu 
1025 1030 1035 1040 

Ser Phe Gly Trp He Ser Gin Thr Leu Gin Lys Lys Val Leu Leu Leu 
. 1045 1050 1055 

Ser Glu Ala Glu He Thr Phe Asn Thr Ser Val Tyr Ser Gin Leu Pro 
1060 1065 1070 
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Gly Gin Glu Ala Phe Leu Arg Ala Gin Val Ser Thr Met Leu Glu Glu 
1075 1080 1085 



Tyr Val Val Tyr Glu Pro Val Phe 
1090 1095 



Leu Met Val Phe Ser Ser Val Gly 
1100 



Gly Leu 
1105 



Leu Leu Leu Ala Leu He 
1110 



Thr Val Ala Leu Tyr Lys Leu Gly 
1115 1120 



Phe Phe 



Lys Arg Gin Tyr Lys Glu 
1125 



Met Leu Asp Leu Pro Ser Ala Asp 
1130 1135 



Pro Asp 



Pro Ala Gly Gin Ala Asp 
1140 



Ser Asn His Glu Thr Pro Pro His 
1145 1150 



Leu Thr 



Ser 
1155 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
AGTTACGGAT CCGGCACCAT GACCTTCGGC ACTGTGATCC TCCTGTGTG 49 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: . DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
GCTGGACGAT GGCATCCAC 19 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTAGAGTTAC GGATCCGGCA CCAT 24 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ' ID NO:50: 
GCAGCCAGCT TCGGACAGAC 2 0 

(2) INFORMATION FOR SEQ ID NO: SI: '". 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid ' 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

, (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51; 
CCATGTCCAC AGAACAGAGA G 21 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3803 base pairs 
.? (B)- TYPE: nucleic acid 

(C) STRANDEDNESS: single 
.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

, (A) NAME/KEY: CDS 

(B) LOCATION: 1..3486 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 52 : ' 

ATG GTC CGT GGA GTT GTG ATC CTC CTG TGT GGC TGG GCC CTG GCT TCC 48 
Met Val Arg Gly Val Val He Leu Leu Cys Gly Trp Ala Leu Ala Ser 

1 - 5 ' _ ; 10 . 15 

TGT CAT GGG TCT AAC CTG GAT GTG GAG AAG CCC GTC GTG TTC AAA GAG 96 
Cys His Gly Ser Asn Leu Asp Val Glu Lys Pro Val Val Phe Lys Glu 
20 25 30 
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GAT GCA GCC AGC TTC GGA CAG ACT GTG GTG CAG TTT GGT GGA TCT CCA 
Asp Ala Ala Ser Phe Gly Gin Thr Val Val Gin Phe Gly Gly Ser Arg 
35 40 45 

CTC GTG GTG GGA GCC CCT CTG GAG GCG GTG GCA GTC AAC CAA ACA GGA 
Leu Val Val Gly Ala Pro Leu Glu Ala Val Ala Val Asn Gin Thr Gly 
50 55 60 



144 



192 



CAG TCG TCT GAC TGT CCG CCT GCC ACT GGC GTG TGC CAG CCC ATC TTA 240 
Gin Ser Ser Asp Cys Pro Pro Ala Thr Gly Val Cys Gin Pro lie Leu 
65 70 75 80 

CTG CAC ATT CCC CTA GAG GCA GTG AAC ATG TCC CTG GGC CTG TCT CTG 288 
Leu His lie Pro Leu Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 
85 90 95 

GTG GCT GAC ACC AAT AAC TCC CAG TTG CTG GCT TGT GGT CCA ACT GCA 336 
Val Ala Asp Thr Asn Asn Ser Gin Leu Leu Ala Cys Gly Pro Thr Ala 
100 105 110 

CAG AG A GCT TGT GCA AAG AAC ATG TAT GCA AAA GGT TCC TGC CTC CTT 384 
Gin Arg Ala Cys Ala Lys Asn Met Tyr Ala Lys Gly Ser Cys Leu Leu- 
115 v - . , .120. 125 

CTG GGC TCC AGC TTG CAG TTC ATC CAG GCA ATC CCT GCT ACC ATG CCA 432 
Leu Gly Ser Ser Leu Gin Phe lie Gin Ala lie Fro Ala Thr Met Pro 

130 135 ... 140 

GAG TGT CCA GGA CAA GAG ATG GAC ATT GCT TTC CTG ATT GAT GGC TCC 480 
Glu Cys Pro Gly Gin Glu Met Asp lie Ala Phe Leu He Asp Gly Ser 
145 150 155 160 

GGC AGC ATT GAT CAA AGT GAC TTT ACC CAG ATG AAG GAC TTC GTC AAA 528 
Gly Ser He Asp Gin Ser Asp Phe Thr Gin Met Lys Asp Phe Val Lys 
165 170 175 

GCT TTG ATG GGC CAG TTG GCG AGC ACC AGC ACC TCG -TTC TCC CTG ATG 576 
Ala Leu Met Gly Gin Leu Ala Ser Thr Ser Thr Ser Phe Ser Leu Met 
180 185 190 

CAA TAC TCA AAC ATC CTG AAG ACT CAT TTT ACC TTC ACG GAA TTC AAG 624 
Gin Tyr Ser Asn He Leu Lys Thr His Phe Thr Pho Thr Glu Phe Lys 
195 200 205 

AGC AGC CTG AGC CCT CAG AGC CTG GTG GAT GCC ATC GTC CAG CTC CAA 672 
Ser Ser Leu Ser Pro Gin Ser Leu Val Asp Ala He Val Gin Leu Gin 
210 215 220 

GGC CTG ACG TAC ACA GCC TCG GGC ATC CAG AAA GTG GTG AAA GAG CTA 720 
Gly Leu Thr Tyr Thr Ala Ser Gly He Gin Lys Val Val Lys Glu Leu 
225 230 235 240 

TTT CAT AGC AAG AAT GGG GCC, CGA AAA AGT GCC AAG AAG ATA CTA ATT 768 
Phe His Ser Lys Asn Gly Ala Arg Lys Ser Ala Lys Lys He Leu He 
. 245 , 250 255 

GTC ATC ACA GAT GGG CAG AAA TTC AGA GAC CCC CTG GAG TAT AGA CAT 816 
Val He Thr Asp Gly Gin Lys Phe Arg Asp Pro Leu Glu Tyr Arg His 

260 ■ .... 265 270 
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GTC ATC CCT GAA GCA GAG AAA GCT. GGG ATC ATT CGC TAT GCT ATA GGG 864 
Val He Pro Glu Ala Glu Lys Ala Gly He He Arg Tyr Ala He Glv 
275 280 285 

GTG GGA GAT GCC TTC CGG GAA CCC ACT GCC CTA CAG GAG CTG AAC ACC 912 
Val Gly Asp Ala Phe Arg Glu Pro Thr Ala Leu Gin Glu Leu Aan Thr 
290 295 300 

ATT GGC TCA GCT CCC TCG CAG GAC CAC GTG TTC AAG GTG GGC AAT TTT 960 
He Gly Ser Ala Pro Ser Gin Asp His Val Phe Lys Val Gly Aan Phe 
305 310 315 320 

GTA GCA CTT CGC AGC ATC CAG CGG CAA ATT CAG GAG AAA ATC TTT GCC 1008 
Val Ala Leu Arg Ser He Gin Arg Gin He Gin Glu Lys He Phe Ala 
325 330 335 

ATT GAA GGA ACC GAA TCA AGG TCA-AGT AGT TCC TTT CAG CAC GAG ATG 1056 
He Glu Gly Thr Glu Ser Arg Ser Ser Ser Scr Phe Gin His Glu Met 
340 345 350 

TCA CAA GAA GGT TTC AGC TCA GCT CTC TCA ATG GAT GGA CCA GTT CTG 1104 
Ser Gin Glu Gly Phe Ser Ser Ala Leu Ser Met Asp Gly Pro Val Leu 
355 360 365 

GGG GCT GTC GGA GGC TTC AGC TGG TCT - GGA GCT GCC TTC TTG TAC CCC 1152 
Gly Ala Val Gly Gly Phe Ser Trp' Ser Gly Gly Ala Phe Leu Tyr Pro 
3^0 375 380 

TCA AAT ATG AGA TCC ACC TTC ATC AAC ATG TCT CAG GAG AAC GAG GAT 1200 
Ser Asn Met Arg Ser Thr Phe He Asn Met Ser Gin Glu Asn Glu Aso 
385 390 395 400 

ATG AGG GAC GCT TAC CTG GGT TAC TCC ACC GCA CTG GCC TTT TGG AAG 1248 
Met Arg Asp Ala Tyr Leu Gly Tyr Ser Thr Ala Leu Ala Phe Trp Lys 
405 410 415 

GGG GTC CAC AGC CTG ATC CTG GGG GCC CCT CGC CAC CAG CAC ACG GGG 1296 
Gly Val His Ser Leu He Leu Gly Ala Pro Arg His Gin His Thr Gly 
420 425 430 

AAG GTT GTC ATC TTT ACC CAG GAA TCC AGG CAC TGG AGG CCC AAG TCT 1344 
Lys Val Val lie Phe Thr Gin Glu Ser Arg His Trp Arg Pro Lys Ser 
435 440 445 

GAA GTC AGA GGG ACA CAG ATC GGC TCC TAC TTT GGG GCA TCT CTC TGT 1392 
Glu Val Arg Gly Thr Gin He Gly Ser tyr Phe Gly Ala Ser Leu Cys 
450 455 460 

TCT GTG GAC ATG GAT AGA GAT GGC AGC ACT GAC CTG CTC CTG ATT GGA 1440 
Ser Val Asp Met Asp Arg Asp Gly Ser Thr Asp Leu Vai Leu He Glv 
465 470 475 480 

GTC CCC CAT TAC TAT GAG CAC ACC CGA GGG GGG CAG GTG TCG GTG TGC 1488 
Val Pro His Tyr Tyr Glu His Thr Arg Gly Gly Gin Val Ser Val Cys 
485 490 495 

CCC ATG CCT GGT GTG AGG AGC AGG TGG CAT TGT GGG ACC ACC CTC CAT 1536 
Pro Met Pro Gly Val Arg Ser Arg Trp His Cys Gly Thr Thr Leu His 
500 505 510 
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GGG GAG CAG GGC CAT CCT TGG GGC CGC TTT GGG GCG GCT CTG ACA GTG 1584 
Gly Glu Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val 
515 520 525 

CTA GGG GAC GTG AAT GGG GAC AGT CTG GCG GAT GTG GCT ATT GGT GCA 1632 
Leu Gly Asp Val Asn Gly Asp Ser Leu Ala Asp Val Ala lie Gly Ala 
530 535 540 

CCC GGA GAG GAG GAG AAC AGA GGT GCT GTC TAC ATA TTT CAT GGA GCC 1680 
Pro Gly Glu Glu Glu Asn Arg Gly Ala Val Tyr lie Phe His Gly Ala 
545 550 555 560 

TCG AGA CAG GAC ATC GCT CCC TCG CCT AGC CAG CGG GTC ACT GGC TCC 1728 
Ser Arg Gin Asp lie Ala Pro Ser Pro Ser Gin Arg Val Thr Gly Ser 
565 570 575 

CAG CTC TTC CTG AGC CTC CAA TAT TTT GGG CAG TCA TTA AGT GGG GGT 1776 
Gin Leu Phe Leu *\rg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly 
580 585 590 

CAG GAC CTT ACA CAG GAT GGC CTG GTG GAC CTG GCC GTG GGA GCC CAG 1824 
Gin Asp Leu Thr Gin Asp Gly ' Leu Val Asp Leu Ala Val Gly Ala Gin 
595 600 605 

GGG CAC GTC CTG CTC CTT ;AGG AGT CTG CCT TTG CTG AAA GTG GGG ATC 1872 
Gly His Val Leu Leu Leu *ftrg Ser Leu Pro Leu Leu Lys Val Gly lie 
610 615 620 

TCC ATT ACA TTT GCC CCC TCA GAG GTG GCA AAG ACT GTG TAC CAG TGC 1920 
Ser lie Arg Phe Ala Pro Ser Glu Val Ala Lys Thr Val Tyr Gin Cys 
625 630 635 640 

TGG GGA AGG ACT CCC ACT GTC CTC GAA GCT GGA GAG GCC ACC GTC TGT 1968 
Trp Gly Arg Thr Pro Thr Vai Leu Glu Ala Gly Glu Ala Thr Val Cys 
645 650 655 

CTC ACT GTC CGC AAA GGT TCA CCT GAC CTG TTA GGT GAT GTC CAA AGC 2016 
Leu Thr Val Arg Lys Gly Ser Pro Asp Leu Leu Gly Asp Val Gin Ser 
660 665 670 

TCT GTC AGG TAT GAT CTG GCG TTG GAT CCG GGC CGT CTG ATT TCT CGT 2064 
Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro Gly Arg Leu He Ser Arg 
675 660 685 

GCC ATT TTT GAT GAG ACG AAG AAC TGC ACT TTG ACC CGA AGG AAG ACT 2112 
Ala He Phe Asp Glu Thr Lys Asn Cys Thr Leu Thr Arg Arg Lys Thr 
690 695 700 

CTG GGG CTT GGT GAT CAC TGC GAA ACA ATG AAG CTG CTT TTG CCA GAC 2160 
Leu Gly Leu Gly Asp His Cys Glu Thr Met Lys Leu Leu Leu Pro Asp 
70S 710 715 720 

TGT GTG GAG GAT GCA GTG ACC CCT ATC ATC CTG CGC CTT AAC TTA TCC 2208 
Cys Val Glu Asp Ala Val' Thr Pro He He Leu Arg Leu Asn Leu Ser 
725 730 735 

CTG GCA GGG GAC TCT GCT CCA TCC AGG AAC CTT CGT CCT GTG CTG GCT 2256 
Leu Ala Gly Asp Ser Ala Pro Ser Arg Asn Leu Arg Pro Val Leu Ala 
740 745 750 
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GTG GGC TCA CAA GAC CAT GTA ACA GCT TCT TTC CCG TTT GAG AAG AAC 2304 
Val Gly Ser Gin Asp His Val Thr Ala Ser Phe Pro Phe Glu Lys Asn 
755 760 765 

TGT AAG CAG GAG CTC CTG TGT GAG GGG AAC CTG GGC GTC AGC TTC AAC 2352 
Cys Lys Gin Glu Leu Leu Cys Glu Gly Asn Leu Gly Val Ser Phe Asn 
770 775 780 

TTC TCA GGC CTG CAG GTC TTG GAG GTA GGA AGC TCC CCA GAG CTC ACT 2400 
Phe Ser Gly Leu Gin Val Leu Glu Val Gly Ser Ser Pro Glu Leu Thr 
785 790 795 800 

GTG ACA GTA ACA GTT TGG AAT GAG GGT GAG GAC AGC TAT GGA ACC TTA 2448 
Val Thr Val Thr Val Trp Asn Glu Gly Glu Asp Ser Tyr Gly Thr Leu 
805 810 815 

ATC AAG TTC TAC TAC CCA GCA GAG CTA TCT TAC CGA CGG GTG ACA AGA 2496 
lie Lys Phe Tyr Tyr Pro Ala Glu„Leu Ser Tyr Arg Arg Val Thr Arg 
820 825 830 

GCC CAG CAA CCT CAT CCG TAC CCA CTA CGC CTG GCA TGT GAG GCT GAG 2544 
Ala Gin Gin Pro His Pro Tyr Pro Leu Arg Leu Ala Cys Glu Ala Glu 
835 840 845 

CCC ACG GGC CAG GAG AGC CTG AGG AGC AGC AGC TGT AGC ATC AAT CAC 2592 
Pro Thr Gly Gin Glu Ser Leu Arg Ser Ser Ser Cys Ser He Asn His 
850 855 860 

CCC ATC TTC CGA GAA GGT GCC AAG GCC ACC TTC ATG ATC ACA TTT GAT 2640 
Pro He Phe Arg Glu Gly Ala Lys Ala Thr Phe Met He Thr Phe Asp 
565 870 875 880 

GTC TCC TAC AAG GCC TTC CTG GGA GAC AGG TTG CTT CTG AGG GCC AGC 2688 
val Ser Tyr Lys Ala Phe Leu Gly Asp Arg Leu Leu Leu Arg Ala Ser 
885 890 895 

GCA AGC AGT GAG AAT AAT AAG CCT GAA ACC AGC AAG ACT GCC TTC CAG 2736 
.Ma Ser Ser Glu Asn Asn Lys Pro Giu Thr Ser Lys Thr Ala Phe Gin 
900 90S 910 

?rc- GAG CTT CCG GTG* AAG TAC ACG GTC TAT ACC GTG ATC AGT AGG CAG 2784 
- u Glu Leu Pro Val Lys Tyr Thr Val Tyr Thr Val He Ser Arg Gin 
915 920 925 

C-AA GAT TCT ACC AAG CAT TTC AAC TTC TCA TCT TCC CAC GGG GAG AGA 2832 
C-lu Asp Ser Thr Lys His Phe Asn Phe Ser Ser Ser His Gly Glu Aro 
930 935 940 

.-AG AAA GAG GCC GAA CAT CGA TAT CGT GTG AAT AAC CTG AGT CCA TTG 2880 
Oin Lys Glu Ala Glu His Arg Tyr Arg Val Asn Asn Leu Ser Pro Leu 
'950 955 960 

ACG CTG GCC ATC AGC GTT AAC TTC TGG GTC CCC ATC CTT CTG AAT GGT 2928 
-r.r Leu Ala He Ser Val Asn Phe Trp Val Pro He Leu Leu Asn Gly 
965 970 975 

"~G GCC GTG TGG GAT GTG ACT CTG AGG AGC CCA GCA CAG GGT GTC TCC 2976 
-1 Ala Val Trp Asp Val Thr Leu Arg Ser Pro Ala Gin Gly Val Ser 
980 985 990 
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TGT GTG TCA CAG AGG GAA CCT CCT CAA CAT TCC GAC CTT CTG ACC CAG 3024 
Cys Val Ser Gin Arg Glu Pro Pro Gin His Ser Asp Leu Leu Thr Gin 
995 1000 1005 

ATC CAA GGA CGC TCT GTG CTG GAC TGC GCC ATC GCC GAC TGC CTG CAC 3072 
He Gin Gly Arg Ser Val Leu Asp Cys Ala He Ala Asp Cys Leu His . 
1010 1015 1020 

CTC CGC TGT GAC ATC CCC TCC TTG GGC ACC CTG GAT GAG CTT GAC TTC 3120 
Leu Arg Cys Asp He Pro Ser Leu Gly Thr Leu Asp Glu Leu Asp Phe 
1025 1030 1035 i0 40 

ATT CTG AAG GGC AAC CTC AGC TTC GGC TGG ATC AG T CAG ACA TTG CAG 3168 
He Leu Lys Gly Asn Leu Ser Phe Gly Trp lie Ser Gin Thr Leu Gin 
1045 1050 1055 

AAA AAG GTG TTG CTC CTG AGT GAG GCT GAA ATC ACA TTC AAC ACA TCT 3216 
Lys Lys Val Leu Leu Leu Ser Glu Ala Glu He Thr Phe Asn Thr Ser 
1060 1065 1070 

GTG TAT TCC CAG CTG CCG GGA CAG GAG GCA TTT CTG AGA GCC CAG GTG 3264 
Val Tyr Ser Gin Leu Pro Gly , Gin Glu Ala Phe Leu Arg Ala Gin Val 
1075 1080 1085 

TCA ACG ATG CTA GAA GAA TAC GTG GTC TAT GAG CCC GTC TTC CTC ATG 3312 
Ser Thr Met Leu Glu Glu tyr Val Val Tyr Glu Pro Val Phe Leu Met 
1090 1095 HOO 

GTG TTC AGC TCA GTG GGA GGT CTG CTG TTA CTG GCT CTC ATC ACT GTG 3360 

Yfi e Phe Ser Ser Val Sly Leu Leu Leu Leu Ala Leu He Thr Val 
1105 mo ins 112Q 

GCG CTG TAC AAG CTT GGC TTC TTC AAA CGT CAG TAT AAA GAG ATG CTG 3408 
Ala Leu Tyr Lys Leu Gly Phe Phe Lys Arg Gin Tyr Lys Glu Met Leu 
1125 H30 H35 

GAT CTA CCA TCT GCA GAT CCT GAC CCA GCC GGC CAG GCA GAT TCC AAC 3456 
Asp Leu Pro Ser Ala Asp Pro Asp Pro Ala Gly Gin Ala Asp Ser Asn 
1140 H45 H50 

CAT GAG ACT CCT CCA CAT CTC ACG TCC TAGGAATCTA CTTTCCTGTA 3S03 
His Glu Thr Pro Pro His Levi Thr Ser 
1155 - iiso 

TATCTCCACA ATTACGAGAT TGGTTTTGCT TTTGCCTATG AATCTACTGG CATGGGAACA 3563 

AGTTCTCTTC AGCTCTGGGC TAGCCTGGGA AACTTCCCAG AAATGATGCC CTACCTCCTG 3623 

AGCTGGGAGA TTTTTATGGT TTGCCCATGT GTCAGATTTC AGTGCTGATC CACTTTTTTT 3683 

GCAAGAGCAG GAATGGGGTC AGCATAAATT TACATATGGA TAAGAACTAA CACAAGACTG 3743 

AGTAATATGC TCAATATTCA ATGTATTGCT TGTATAAATX TTTAAAAAAT AAAATGAAAN 3803 



(2) INFORMATION FOR SEQ* ID NO: 53: . 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1161 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Met Val Arg Gly Val Val He Leu Leu Cys Gly Trp Ala Leu Ala Ser 
15 10 15 

Cys His Gly Ser Aen Leu Asp Val Glu Lys Pro Val Val Phe Lys Glu 
20 25 30 

Asp Ala Ala Ser Phe Gly Gin Thr Val Val Gin Phe Gly Gly Ser Arg 
35 40 45 

Leu Val Val Cly Ala Pro Leu Glu Ala Val Ala Val Asn Gin Thr Gly 
50 55 60 

Gin Ser Ser Asp Cys Pro Pro Ala Thr Gly Val Cys Gin Pro He Leu 
65 70 75 80 

Leu His He Pro Lou Clu Ala Val Aen Met Ser Leu Gly Leu Ser Leu 
85 90 ,95 

Val Ala Asp Thr Asn San Ser Gin Leu Leu Ala Cys Gly Pro Thr Ala 
100 105 HO 

Gin Arg Ala Cys Ala Ly3 Asn Met Tyr Ala Ly3 Gly Ser Cys Leu Leu 
115 120 125 

Leu Gly Ser Ser Leu Gin Phe lie Gin Ala He Pro Ala Thr Met Pro 
130 135 140 

Glu Cys Pro Gly Gin Glu Met Asp He Ala Phe Leu He Asp Gly Ser 
145 150 155 160 

Gly Ser He Asp Gin Ser Asu Phe Thr Gin Met Lys Asp Phe Val Lys 
165 170 175 

Ala Leu Met. Gly Gin Leu Ala Ser Thr Ser Thr Ser Phe Ser Leu Met 
180 185 190 

Gin Tyr Ser Asn He Leu Lys Thr Hist Phe Thr Phe Thr Glu Phe Lye 
195 200 205 

Ser Ser Leu Ser Pro Gin Ser Leu Val Asp Ala He Val Gin Leu Gin 
210 215 220 

Gly Leu Thr Tyr Thr Ala Ser Gly He Gin Lys Val Val Lys Glu Leu 
225 230 235 240 

Phe His Ser Lys Asn Gly Ala Arg Lys Ser Ala Lys Lys He Leu He 
245 250 255 

Val He Thr Asp Gly Gin Lys Phe Arg Asp Pro Leu Glu Tyr Arg His 
260 v 265 270 

Val He Pro Glu Ala Glu Lys Ala Gly He He Arg Tyr Ala He Gly 
275 280 265 



8NS0OCID: <WO_9517412A1_l_> 



WO 95/17412 PCT/US94/14832 



- 135 - 

Val Gly Asp Ala Phe Arg Glu Pro Thr Ala Leu Gin Glu Leu Asn Thr 
290 295 300 

lie Gly Ser Ala Pro Ser Gin Asp His Val Phe Lys Val Gly Asn Phe 
305 310 315 320 

Val Ala Leu Arg Ser He Gin Arg Gin lie Gin Glu Lys He Phe Ala 
325 330 335 

He Glu Gly Thr Glu ser Arg Ser Ser Ser ser Phe Gin His Glu Met 
340 345 350 

Ser Gin Glu Gly Phe Ser Ser Ala Leu Ser Met Asp Gly Pro Val Leu 
355 360 365 

Gly Ala Val Gly Gly Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro 
370 375 380 

Ser Asn Met Arg Ser Thr Phe lie Asn Met Ser Gin Glu Asn Glu Asp 
385 390 395 400 

Met Arg Asp Ala Tyr Leu Gly Tyr Ser Thr Ala Leu Ala Phe Trp Lys 
405 410 415 

Gly Val His Ser Leu He Leu Gly Ala Pro Arg His Gin His Thr Gly 
420 425 430 

Lys Val Val He Phe Thr Gin Glu Ser Arg His Trp Arg Pro Lys Ser 
435 440 445 

Glu Val Arg Gly Thr Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys 
450 455 460 

Ser VaL Asp Met Asp Arg Asp Gly Ser Thr Asp Leu Val Leu He Gly 
465 470 475 480 

Val Pro His Tyr Tyr Glu His Thr Arg Gly Gly Gin Val Ser Val Cys 
485 490 495 

Pro Met Pro Gly Val Arg Ser Arg Trp His Cys Gly Thr Thr Leu His 
500 505 510 

Gly Glu Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val 
515 520 525 

Leu Gly Asp Val Asn Gly Asp Ser Leu Ala Asp Val Ala He Gly Ala 
530 535 540 

Pro Gly Glu Glu Glu Asn Arg Gly Ala Val Tyr lie Phe His Gly Ala 
545 550 555 560 

Ser Arg Gin Asp He Ala Pro Ser Pro Ser Gin Arg Val Thr Gly Ser 
565 570 575 

Gin Leu Phe Leu Arg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly 
580 585 590 

Gin Asp Leu Thr Gin Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin 
595 600 605 
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Gly His Val Leu Leu Leu Arg Ser Leu Pro Leu Leu Lys Val Gly lie 
610 615 620 

Ser He Arg Phe Ala Pro Ser Glu Val Ala Lye Thr Val Tyr Gin Cvs 
625 630 635 640 

Trp Gly Arg Thr Pro Thr Val Leu Glu Ala Gly Glu Ala Thr Val Cye 
645 650 655 

Leu Thr Val Arg Lys Gly Ser Pro Asp Leu Leu Gly Asp Val Gin Ser 
660 665 670 

Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro Gly Arg Leu He Ser Ara 
675 680 685 

Ala He Phe Asp Glu Thr Lys Ash Cya Thr Leu Thr Arg Arg Lys Thr 
690 695 700 

Leu Gly Leu Gly Asp His Cys Glu Thr Met Lys Leu Leu Leu Pro Asp 
705 710 715 720 

Cys Val Glu Asp Ala Val Thr Pro. lie He Leu Arg Leu Asn Leu Ser 
725 730 735 

Leu Ala Gly Asp Ser Ala Pro Ser Arg Asn Leu Arg Pro Val Leu Ala 
740 745 750 

Val Gly ser Gin Asp His Val Thr Ala Ser Phe Pro Phe Glu Lys Asn 
755 760 : 765 

Cys Lye Gin Glu Leu Leu Cys Glu Gly Asn Leu Gly Val Ser Phe Asn 
770 775 780 

Phe Ser Gly Leu Gin Val Leu Glu Val Gly Ser Ser Pro Glu Leu Thr 
785 790 795 800 

Val Thr Val Thr. Val Trp Asn Glu Gly Glu Asp Ser Tyr Gly Thr Leu 
80S 810 815 

He Lye Phe Tyr Tyr Pro Ala Glu Leu Ser Tyr Arg Arg Val Thr Arg 
820 825 830 

Ala Gin Gin Pro His Pro Tyr Pro Leu Arg Leu Ala Cys Glu Ala Glu 
835 . 840 845 

Pro Thr Gly Gin Glu Ser Leu Arg Ser Ser Ser Cys Ser He Asn His 
850 855 860 

Pro He Phe Arg Glu Gly Ala Lys Ala Thr Phe Met He Thr Phe Asp 
865 870 875 880 

Val Ser Tyr Lys Ala Phe Leu Gly Asp Arg Leu Leu Leu Arg Ala Ser 
885 890 895 

Ala Ser Ser Glu Asn Asn Lys Pro Glu Tfcr Ser Lys Thr Ala Phe Gin 
900 90S 910 

Leu Glu Leu Pro Val Lys Tyr Thr Val Tyr Thr Val He Ser Arg Gin 
915 920 925 
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Glu Asp Ser Thr Lys His Phe Asn Phe Ser Ser Ser His Gly Glu Arg 
930 935 940 

Gin Lys Glu Ala Glu His Arg Tyr Arg Val Asn Asn Leu Ser Pro Leu 
945 950 955 960 

Thr Leu Ala lie Ser Val Asn Phe Trp Val Pro lie Leu Leu Asn Gly 
965 970 975 

Val Ala Val Trp Asp Val Thr Leu Arg Ser Pro Ala Gin Gly Val Ser 
980 985 990 

Cys Val Ser Gin Arg Glu Pro Pro Gin His Ser Asp Leu Leu Thr Gin 
995 1000 1005 

lie Gin Gly Arg Ser Val Leu Asp Cys Ala He Ala Asp Cys Leu His 
1010 1015 1020 

Leu Arg Cys Asp He Pro Ser Leu Gly Thr Leu Asp Glu Leu Asp Phe 
1025 1030 1035 " 1040 

He Leu Lys Gly Asn Leu Sec Phe Gly Trp He Ser Gin Thr Leu Gin 
1045 1050 1055 

Lys Lys Val Leu Leu Leu Ser Glu Ala Glu lie. Thr Phe Asn Thr Ser 
1060 1065 1070 

Val Tyr Ser Gin Leu Pro Gly Gin Glu Ala Phe Leu Arg Ala Gin Val 
1075 1080 1085 

Ser Thr Met Leu Glu Glu Tyr Val Val Tyr Glu Pro Val Phe Leu Met 
1090 1095 HOO 

Val Phe Ser Ser Val Gly Gly Leu Leu Leu Leu Ala Leu He Thr Val 
1105 1110 1115 H20 

Ala Leu Tyr Lys Leu Gly Phe Phe Lys Arg Gin Tyr Lys Glu Met Leu 
1125 1130 H35 

Asp Leu Pro Ser Ala Asp Pro Asp Pro Ala Gly Gin Ala Asp Ser Asn 
1140 1145 1150 

His Glu Thr Pro Pro His Leu Thr Ser 
1155 1160 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3597 base pairs' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 40. ,3525 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

AGCTTTACAG CTCTCTACTT CTCAGTGCAC TGCTCAGTG ATG GCC GGT GGA GTT 54 

Met Ala Gly Gly Val 
1 5 

GTG ATC CTC CTG TGT GGC TGG GTC CTG GCT TCC TGT CAT GGG TCT AAC 102 
Val He Leu Leu Cys Gly Trp Val Leu Ala Ser Cys His Gly Ser Asn 
10 15 20 

CTG GAT GTG GAG GAA CCC ATC GTG TTC AGA GAG GAT GCA GCC AGC TTT 150 
Leu Asp Val Glu Glu Pro He Val Phe Arg Glu Asp Ala Ala Ser Phe 
25 30 35 

GGA CAG ACT GTG GTG CAG TTT GGT GGA TCT CGA CTC GTG GTG GGA GCC 198 
Gly Gin Thr Val Val Gin Phe Gly Gly Ser Arg Leu Val Val Gly Ala 
40 45 so 

CCT CTG GAG GCG GTG GCA GTC AAC CAA ACA GGA CGG TTG TAT GAC TGT 246 
Pro Leu Glu Ala Val Ala Val Asn Gin Thr Gly Arg Leu Tyr Asp Cys 
55 60 65 

GCA CCT GCC ACT GGC ATG TGC CAG CCC ATC GTA CTG CGC AGT CCC CTA 294 
Ala Pro Ala Thr Gly Met Cys Gin Pro He Val Leu Arg Ser Pro Leu 
70 75 80 85 

GAG GCA GTG AAC ATG TCC CTG GGC CTG TCT CTG GTG ACT GCC ACC AAT 342 
Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu Val Thr Ala Thr Asn 
90 ; 95 ioo 

AAC GCC CAG TTG CTG GCT TGT GGT CCA ACT GCA CAG AGA GCT TGT GTG 390 
Asn Ala Gin Leu Leu Ala Cys Gly Pro Thr Ala Gin Arg Ala Cys Val 
105 HO us 

AAG AAC ATG TAT GCG AAA GGT TCC TGC CTC CTT CTC GGC TCC AGC TTG 438 
Lys Asn Met Tyr Ala Lys Gly Ser Cys Leu Leu Leu Gly ser Ser Leu 
120 125 130 

CAG TTG ATC CAG GCA GTC CCT GCC TCC ATG CCA GAG TGT CCA AGA CAA 486 
Gin Phe He Gin Ala Val Pro Ala Ser Met Pro Glu Cys Pro Arg Gin 
135 140 : 145 

GAG ATG GAC ATT GCT TTC CTG ATT GAT GGT TCT GGC AGC ATT AAC CAA 534 
Glu Met Asp He Ala Phe Leu He Asp Gly Ser Gly Ser lie Asn Gin 
150 155 160 165 

AGG GAC TTT GCC CAG ATG AAG GAC TTT GTC AAA GCT TTG ATG GGA GAG 582 
Arg Asp Phe Ala Gin Met Lys Asp Phe Val Lys Ala Leu Met Gly Glu 
170 175 180 

TTT GCG AGC ACC AGC ACC TTG TTC TCC CTG ATG CAA TAC TCG AAC ATC 630 
Phe Ala Ser Thr Ser Thr Leu Phe Ser Leu Met Gin Tyr Ser Asn He 
135 190 195 

CTG AAG ACC CAT TTT ACC TTC ACT GAA TTC AAG AAC ATC CTG GAC CCT 678 
Leu Lys Thr His Phe Thr Phe Thr Glu Phe Lys Asn He Leu Asp Pro 
200 .205 210 

CAG AGC CTG GTG GAT CCC ATT GTC CAG CTG CAA GGC CTG ACC TAC ACA 726 
Gin Ser Leu Val Asp Pro He Val Gin Leu Gin Gly Leu Thr Tyr Thr 
215 220 225 
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GCC ACA GGC ATC CGG ACA GTG ATG GAA GAG CTA TTT CAT AGO AAG AAT 774 
Ala Thr Gly lie Arg Thr Val Met Glu Glu Leu Phe His Ser Lys Asn 
230 235 240 245 

GGG TCC CGT AAA AGT GCC AAG AAG ATC CTC CTT GTC ATC ACA GAT GGG 822 
Gly Ser Arg Lya Ser Ala Lys Lys He Leu Leu Val He Thr Asp Gly 
250 255 260 

CAG AAA TAC AG A GAC CCC CTG GAG TAT AGT GAT GTC ATT CCC GCC GCA 870 
Gin Lys Tyr Arg Asp Pro Leu Glu Tyr Ser Asp Val He Pro Ala Ala 
265 270 275 

GAC AAA GCT GGC ATC ATT CGT TAT GCT ATT GGG GTG GGA GAT GCC TTC 918 
Asp Lys Ala Gly He He Arg Tyr Ala He Gly Val Gly Asp Ala Phe 
280 285 290 

CAG GAG CCC ACT GCC CTG AAG GAG CTG AAC ACC ATT GGC TCA GCT CCC 966 
Gin Glu Pro Thr Ala Leu Lys Glu Leu Asn Thr He Gly Ser Ala Pro 
295 300 305 

CCA CAG GAC CAC GTG TTC AAG GTA GGC AAC TTT GCA GCA CTT CGC AGC 1014 
Pro Gin Asp His Val Phe Lys Val Gly Asn Phe Ala Ala Leu Arg Ser 
310 315 320 325 

ATC CAG AGG CAA CTT CAG GAG AAA ATC TTC GCC ATT GAG GGA ACT CAA 1062 
He Gin Arg Gin Leu Gin Glu Lys He Phe Ala He Glu Gly Thr Gin 

330 . ■ « 335 340 

TCA AGG TCA AGT AGT TCC TTT CAG CAC GAG ATG TCA CAA GAA GGT TTC 1110 
Ser Arg Ser Ser Ser Ser Phe Gin His Glu Met Ser Gin Glu Gly Phe 
345 , 350 355 

AGT TCA GCT CTC ACA TCG GAT GGA CCC GTT CTG GGG GCC GTG GGA AGC 1158 
Ser Ser Ala Leu Thr Ser Asp Gly Pro Val Leu Gly Ala Val Gly Ser 
360 365 370 

TTC AGC TGG TCC GGA GGT GCC TTC TTA TAT CCC CCA AAT ACG AGA CCC 1206 
Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro Pro Asn Thr Arg Pro 

375 . ~ 380 385 

ACC TTT ATC AAC ATG TCT CAG GAG AAT GTG GAC ATG AGA GAC TCC TAC 1254 
Thr Phe He Asn Met Ser Gin Glu Asn Val Asp Met Arg Asp Ser Tyr 
390 395. 400 405 

CTG GGT TAC TCC ACC GCA GTG GCC TTT TGG AAG GGG GTT CAC AGC CTG 1302 
Leu Gly Tyr Ser Thr Ala Val Ala Phe Trp Lys Gly Val His Ser Leu 
410 415 . 420 

ATC CTG GGG GCC CCG CGT CAC CAG CAC ACG GGG AAG GTT GTC ATC TTT 1350 
He Leu Gly Ala Pro Arg His Gin His Thr Gly Lys Val Val He Phe 
425 430 435 

ACC CAG GAA GCC AGG CAT TGG AGG CCC AAG TCT GAA GTC AGA GGG ACA 1398 
Thr Gin Glu Ala Arg His Trp Arg Pro Lys Ser Glu Val Arg Gly Thr 
440 445 450 

CAG ATC GGC TCC TAC TTC GGG GCC TCT CTC TGT TCT GTG GAC GTG GAT 1446 
Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser Val Asp Val Asp 
455 , 460, 465 
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AGA GAT GGC AGC ACY GAC CTG GTC CTG ATC GGA GCC CCC CAT TAC TAT 1494 
Arg Asp Gly Ser Xaa Asp Leu Val Leu He Gly Ala Pro His Tyr Tyr 
470 475 480 485 

GAG CAG ACC CGA GGG GGG CAG GTC TCA GTG TTC CCC GTG CCC GGT GTG 1542 
Glu Gin Thr Arg Gly Gly Gin Val Ser Val Phe Pro Val Pro Gly Val 
490 495 500 

AGG GGC AGG TGG CAG TGT GAG GCC ACC CTC CAC GGG GAG CAG GGC CAT 1590 
Arg Gly Arg Trp Gin Cys Glu Ala Thr Leu His Gly Glu Gin Gly His 
505 510 515 

CCT TGG GGC CGC TTT GGG GTG GCT CTG ACA GTG CTG GGG GAC GTA AAC 1638 
Pro Trp Gly Arg Phe Gly Val Ala Leu Thr Val Leu Gly Asp Val Asn 
520 525 530 

GGG GAC AAT CTG GCA GAC GTG GCT ATT GGT GCC CCT GGA GAG GAG GAG 1686 
Gly Asp Asn Leu Ala Asp Val Ala He Gly Ala Pro Gly Glu Glu Glu 
535 540 545 

AGC AGA GGT OCT GTC TAC ATA TTT CAT GGA GCC TCG AGA CTG GAG ATC 1734 
Ser Arg Gly Ala Val Tyr lie Phe Kis Gly Ala Ser Arg Leu Glu He 
550 555 * 560 565 

ATG CCC TCA CCC AGC CAG CGG GTC ACT GGC TCC CAG CTC TCC CTG AGA 1782 
Met Pro Ser Pro Ser Gin Arg Val Thr Gly Ser Gin Leu Ser Leu Arg 
570 575 580 

CTG CAG TAT TTT GGG CAG TCA TTG AGT GGG GGT CAG GAC CTT ACA CAG 1830 
Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly Gin Asp Leu Thr Gin 
565 590 595 

GAT GGC CTG GTG GAC CTG GCC GTG GGA GCC CAG GGG CAC GTA CTG CTG 1878 
Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin Gly His Val Leu Leu 
600 605 610 

CTC AGG AGT CTG CCT CTG CTG AAA CTG GAG CTC TCC ATA AGA TTC GCC 1926 
Leu Arg Ser Leu Pro Leu Leu Lys Val Glu Leu Ser lie Arg Phe Ala 
615 620 625 

CCC ATG GAG GTG GCA AAG GCT GTG TAC CAG TGC TGG GAA AGG ACT CCC 1974 
Pro Met Glu Val Ala Lys Ala Val Tyr Gin Cys Trp Glu Arg Thr Pro 
630 635 640 645 

ACT GTC CTC GAA GCT GGA GAG GCC ACT GTC TGT CTC ACT GTC CAC AAA 2022 
Thr Val Leu Glu Ala Gly Glu Ala Thr Val Cys Leu Thr Val His Lys 
650 655 660 

GGC TCA CCT GAC CTG TTA GGT AAT GTC CAA GGC TCT GTC AGG TAT GAT 2070 
Gly Ser Pro Asp Leu Leu Gly Asn Val Gin Gly Ser Val Arg Tyr Asp 
665 670 675 

CTG GCG TTA GAT CCG GCC CGC CTC ATT TCT CGT CCC ATT TTT GAT GAG 2118 
Leu Ala Leu Asp Pro Gly Arg Leu He Ser Arg Ala lie Phe Asp Glu 
680 685 690 

ACT AAG AAC TGC ACT TTG ACG GGA AGG AAG ACT CTG GGG CTT GGT GAT 2166 
Thr Lys Asn Cys Thr Leu Thr Gly Arg Lys Thr Leu Gly Leu Gly Asp 
695 700 705 
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CAC TGC GAA ACA GTG AAG CTG CTT TTG CCG GAC TGT GTG GAA GAT GCA 2214 
His Cys Glu Thr Val Lys Leu Leu Leu Pro Asp Cys Val Glu Asp Ala 
710 715 720 725 

GTG AGC CCT ATC ATC CTG CGC CTC AAC TTT TCC CTG GTG AGA GAC TCT 2262 
Val Ser Pro He He Leu Arg Leu Asn Phe Ser Leu Val Arg Asp Ser 
730 735 740 

GCT TCA CCC AGG AAC CTG CAT CCT GTG CTG GCT GTG GGC TCA CAA GAC 2310 
Ala Ser Pro Arg Asn Leu His Pro Val Leu Ala Val Gly Ser Gin Asp 
745 750 755 

CAC ATA ACT GCT TCT CTG CCG TTT GAG AAG AAC TGT AAG CAA GAA CTC 2358 
His He Thr Ala Ser Leu Pro Phe Glu Lys Asn Cys Lys Gin Glu Leu 
760 765 770 

CTG TGT GAG GGG GAC CTG GGC ATC AGC TTT AAC TTC TCA GGC CTG CAG 2406 
Leu Cys Glu Gly Asp Leu Gly He Ser Phe Asn Phe Ser Gly Leu Gin 
775 780 785 

GTC TTG GTG GTG GGA GGC TCC CCA GAG CTC ACT GTG ACA GTC ACT GTG 2454 
Val Leu Val Val Gly Gly Ser Pro Glu Leu Thr Val Thr Val Thr Val 
7 90 795 800 805 

TGG AAT GAG GGT GAG GAC AGC TAT GGA ACT TTA GTC AAG TTC TAG TAC 2502 
Trp Asn Glu Gly Olu Asp Ser Tyr Gly Thr Leu Val Lys Phe Tyr Tyr 
810 815 820 

CCA GCA GGG CTA TCT TAC CGA CGG GTA ACA GGG ACT CAG CAA CCT CAT 2550 
Pro Ala Gly Leu Ser Tyr Arg Arg Val Thr Gly Thr Gin Gin Pro His 
825 830 835 

CAG TAC CCA CTA CGC TTG CCC TGT GAG GCT GAG CCC GCT GCC CAG GAG 2598 
Gin Tyr Pro Leu Arg Leu Ala Cys Glu Ala Glu Pro Ala Ala Gin Glu 
840 845 850 

GAC CTG AGG AGC AGC AGC TGT AGC ATT AAT CAC CCC ATC TTC CCA GAA 2646 
Asp Leu Arg Ser Ser Ser Cys Ser He Asn His Pro He Phe Arc Glu 
855 860 865 

GGT GCA AAG ACC ACC TTC ATG ATC ACA TTC GAT GTC TCC TAC AAG GCC 2694 
Gly Ala Lys Thr Thr Phe Met He Thr Phe Asp Val Ser Tyr Lys Ala 
870 875 .880 885 

TTC CTA GGA GAC AGG TTG CTT CTG AGG GCC AAA GCC AGC AGT GAG AAT 2742 
Phe Leu Gly Asp Arg Leu Leu Leu Arg Ala Lys . Ala Ser Ser Glu Asn 
890 895 900 

AAT AAG CCT GAT ACC AAC AAG ACT GCC TTC CAG CTG GAG CTC CCA GTG 2790 
Asn Lys Pro Asp Thr Asn Lys Thr Ala Phe Gin Leu Glu Leu Pro Val 
905 910 915 

AAG TAC ACC GTC TAT ACC CTG ATC AGT AGG CAA GAA GAT TGC ACC AAC 2838 
Lys Tyr Thr Val Tyr Thr Leu He Ser Arg Gin Glu Asp Ser Thr Asn 
920 925 930 

CAT GTC AAC TTT TCA TCT TCC CAC GGG GGG AGA AGG CAA GAA GCC GCA 2886 
His Val Aen Phe Ser Ser Ser Hie Gly Gly Arg Arg Gin Glu Ala Ala 
935 940 945 
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CAT CGC TAT CGT GTG AAT AAC CTG AGT CCA CTG AAG CTG GCC GTC AGA 
Hie Arg Tyr Arg Val Asn Asn Leu Ser Pro Leu Lys Leu Ala Val Arg 
950 955 960 965 

GTT AAC TTC TGG GTC CCT GTC CTT CTG AAC GGT GTG GCT GTG TGG GAC 
Val Asn Phe Trp Val Pro Val Leu Leu Asn Gly Val Ala Val Trp Asp 
970 975 980 

GTG ACT CTG AGC AGC CCA GCA CAG GGT GTC TCC TGC GTG TCC CAG ATG 
Val Thr Leu ser Ser Pro Ala Gin Gly Val Ser Cys Val Ser Gin Met 
985 990 995 



2934 



2982 



3030 



AAA CCT CCT CAG AAT CCC GAC TTT CTG ACC CAG ATT CAG AGA CGT TCT 
Lys Pro Pro Gin Asn Pro Asp Phe Leu Thr Gin lie Gin Arg Arg Ser 
1000 1005 1010 



3078 



GTG CTG GAC TGC TCC ATT GCT GAC TGC CTG CAC TTC CGC TGT GAC ATC 
Val Leu Asp Cys Ser He Ala Asp Cys Leu His Phe Arg Cys Asp lie 
1015 1020 1025 

CCC TCC TTG GAC ATC CAG GAT GAA CTT GAC TTC ATT CTG AGG GGC AAC 
Pro Ser Leu Asp lie Gin Asp Glu Leu Asp Phe He Leu Arg Gly Aen 
1030 1035 1040 1045 

CTC AGC TTC GGC TGG GTC AGT CAG ACA TTG CAG GAA AAG GTG TTG CTT 
Leu ser Phe Gly Trp Val ser Gin Thr Leu Gin Glu Lys val Leu Leu 
1050 1055 1060 

GTG AGT GAG GCT GAA ATC ACT TTC GAC ACA TCT GTG TAC TCC CAG CTG 
Val Ser Glu Ala Glu He Thr Phe Asp Thr Ser Val Tyr Ser Gin Leu 
1065 1070 1075 

CCA GGA CAG GAG GCA TTT CTG AGA GCC CAG GTG GAG ACA ACG TTA GAA 
Pro Gly Gin Glu Ala Phe Leu Arg Ala Gin Val Glu Thr Thr Leu Glu 
1080 1085 1090 . 

GAA TAC GTG GTC TAT GAG CCC ATC TTC CTC GTG GCG GGC AGC TCG GTG 
Glu Tyr Val Val Tyr Glu Pro He Phe Leu Val Ala Gly Ser Ser Val 
1095 HOO . H05 



GGA GGT CTG CTG TTA CTG GCT CTC ATC ACA GTG GTA CTG TAC AAG CTT 
Gly Gly Leu Leu Leu Leu Ala Leu He Thr Val Val Leu Tyr Lys Leu 
1110 1115 1120 H25 



GGC TTC TYC AAA CGT CAG TAC AAA GAA ATG CTG GAC GGC AAG GCT GCA 
Gly Phe Xaa Lys Arg Gin Tyr Lys Glu Met Leu Asp Gly Lys Ala Ala 
1130 1135 1140 

GAT CCT GTC ACA GCC GGC CAG GCA GAT TTC GGC TGT GAG ACT CCT CCA 
Asp Pro Val Thr Ala Gly Gin Ala Asp Phe Gly Cys Glu Thr Pro Pro 
1145 1150 H55 

TAT CTC GTG AGC TAGGAATCCA CTCTCCTGCC TATCTCTGCA ATGAAGATTG 
Tyr Leu Val Ser 
1160 



3126 



3174 



3222 



3270 



3318 



3366 



3414 



3462 



3510 



3562 



GTCCTGCCTA TGAGTCTACT GGCATGGGAA CGAGT 



3597 
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' (2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 : 

Met Ala Cly Gly Val Val He Leu Leu Cya Gly Trp Val Leu Ala Ser 
1 5 10 is 

Cys His Gly Ser Asn Leu Asp Val Glu Glu Pro He Val Phe Arg Glu 
20 25 30 

Asp Ala Ala Ser Phe Cly Gin Thr Val Val Gin Phe Gly Gly Ser Arc 
35 40 4S 

Leu val Val Gly Ala Pro Leu Glu Ala Val Ala Val Asn Gin Thr Glv 

50 55 ; ' 60 

Arg Leu Tyr Asp Cys Ala Pro Ala Thr Gly Met Cys Gin Pro He Val 
65 70 75 80 

Leu Arg Ser Pro L#u Clu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 
85 90 95 

Val Thr Ala Thr Asn Asn Ala Gin Leu Leu Ala Cys Gly Pro Thr Ala 
100 105 no 

Gin Arg Ala Cys Val Lys Asn Met Tyr Ala Lys Gly Ser Cys Leu Leu 
115 120 125 

Leu Gly Ser Ser Leu Gin Phe He Gin Ala Val Pro Ala S«r Met Pro 

'130 135 ~ 140 

Glu Cys Pro Arg Gin Glu Met Asp lie Ala Phe Leu He Asp Glv Ser 
145 150 . . . 155 160 

Gly Ser He Asn Gin Arg Asp Phe Ala Gin Met Lys Asp Phe Val Lys 
165 170 175 

Ala Leu Met Gly clu Fhe Ala Ser Thr Ser thr Leu Phe Ser Leu Met 
180 185 190 

Gin Tyr Ser Asn He Leu Lys Thr His Phe Thr Phe Thr Glu Phe Lvs 

155 - 200 205 : * 

Asn He Leu Asp Pro Gin Ser Leu Val Asp Pro He Val Gin Leu Gin 
210 215 . „ 220 

Gly Leu Thr Tyr Thr Ala Thr Gly He Arg Thr Val Met Glu Glu Leu 
225 230 235 240 

Phe His Ser Lys Asn Gly Ser Arg Lys Ser Ala Lys Lys lie Leu Leu 
245 250 255 

Val He Thr Asp Gly Gin Lys Tyr Arg Asp Pro Leu Glu Tyr Ser Asp 
260 265 270 
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Val lie Pro Ala Ala Asp Lys Ala Gly He lie Arg Tyr Ala He Cly 
275 280 285 

Val Gly Asp Ala Phe Gin Glu Pro Thr Ala Leu Lys Glu Leu Asn Thr 
290 295 300 

He Gly Ser Ala Pro Pro Gin Asp His Val Phe Lys Val Gly Asn Phe 
305 310 315 320 

Ala Ala Leu Arg Ser He Gin Arg Gin Leu Gin Glu Lys He Phe Ala 
325 330 335 

He Glu Gly Thr Gin Ser Arg Ser Ser Ser Ser Phe Gin His Glu Met 
340 345 350 

Ser Gin Glu Gly Phe Ser Ser Ala Leu Thr Ser Asp Gly Pro Val Leu 
355 360 365 

Gly Ala Val Gly Sar Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro 
370 375 380 

Pro Asn Thr Arg Pro Thr Phe He Asn Met Ser Gin Glu Asn Val Asp 
385 390 395 400 

Met Arg Asp Ser Tyr Leu Gly Tyr Ser Thr Ala Val Ala Phe Trp Lys 
405 410 415 

Gly Val His Ser Leu He Lau Gly Ala Pro Arg His Gin His Thr Gly 
420 425 430 

Lys Val Val He Phe Thr Gin Glu Ala Arg His Trp Arg Pro Lye Ser 
435 440 445 

Glu Val Arg Gly Thr Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys 
450 455 460 

Ser Val Asp Val Asp Arg Asp Gly Ser Xaa Asp Leu Val Leu He Gly 
4fi 5 470 475 480 

Ala Prp Hio Tyr Tyr Glu Gin Thr Arg Gly Gly Gin Val Ser Val Phe 
485 490 495 

Pro Val Pro Gly Val Arg Gly Arg Trp. Gin Cys Glu Ala Thr Leu His 
500 505 510 

Gly Glu Gin Gly His Pro Trp Gly Arg Phe Gly Val Ala Leu Thr Val 
515 520 525 

Leu Gly Asp Val Asn Gly Asp Asn Leu Ala Asp Val Ala He Gly Ala 
530 S35 540 

Pro Gly Glu Glu Glu Ser Arg Gly . Ala Val Tyr He Phe His Gly Ala 
545 550 .555 560 

Ser Arg Leu Glu lie -Met Pro Ser Pro Ser Gin Arg Val Thr Gly Ser 
565 570 575 

Gin Leu. Ser Leu Arg Leu Gin . Tyr Phe Gly Gin Ser Leu Ser Gly Gly 



580 



585 



590 
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Gin Asp Leu Thr Gin Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin 
595 600 605 

Gly His Val Leu Leu Leu Arg Ser Leu Pro Leu Leu Lys Val Glu Leu 
610 615 620 

Ser lie Arg Phe Ala Pro Met Glu Val Ala Lys Ala Val Tyr Gin Cys 
625 630 635 640 

Trp Glu Arg Thr Pro Thr Val Leu Glu Ala Gly Glu Ala Thr val Cys 
645 650 655 

Leu Thr Val His Lys Gly Ser Pro Asp Leu Leu Gly Asn Val Gin Gly 
660 665 670 

Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro Gly Arg Leu lie Ser Arg 
675 680 685 

Ala He Phe Asp Glu Thr Lys Asn Cys Thr Leu Thr Gly Arg Lys Thr 
690 695 700 

Leu Gly Leu Gly Asp His Cys Glu Thr Val Lys Leu Leu Leu Pro Asp 
705 710 715 720 

Cys Val Glu Asp Ala Val Ser Pro He lie Leu Arg Leu Asn Phe Ser 
725 730 735 

Leu Val Arg Asp Ser Ala Ser Pro Arg Asn Leu His Pro Val Leu Ala 
740 745 750 

Val Gly Ser Gin Asp His He Thr Ala Ser Leu Pro Phe Glu Lys Asn 
755 760 765 

Cys Lys Gin Glu Leu Leu Cys Glu Gly Asp Leu Gly He Ser Phe Asn 
770 775 780 

Phe Ser Gly Leu Gin Val Leu Val Val Gly Gly Ser Pro Glu Leu Thr 
785 790 795 800 

Val Thr Val Thr Val Trp Asn Glu Gly Glu Asp Ser Tyr Gly Thr Leu 
805 810 815 

Val Lys Phe Tyr Tyr Pro Ala Gly Leu Ser Tyr Arg Arg Val Thr Gly 
820 825 830 

Thr Gin Gin Pro His Gin Tyr Pro Leu Arg Leu Ala Cys Glu Ala Glu 
835 640 845 

Pro Ala Ala Gin Glu Asp Leu; Arg Ser Ser Ser Cys Ser lie Asn His 
850 855 860 

Pro He Phe Arg Glu Gly Ala Lys Thr Thr Phe Met He Thr Phe Asp 
865 870 875 880 

Val Ser Tyr Lys Ala Phe Leu Gly Asp Arg Leu Leu Leu Arg Ala Lys 
885 890 895 

Ala. Ser Ser Glu Asn Asn Lys Pro Asp Thr Asn Lys Thr Ala Phe Gin 
900 905 910 
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Leu Glu Leu Pro Val Lya Tyr Thr Val Tyr Thr Leu lie Ser Arg Gin 
915 920 925 

Glu Asp Ser Thr Asn His Val Aen Phe Ser Ser Ser His Gly Glv Aro 
930 935 940 

Arg Gin Glu Ala Ala His Arg Tyr Arg Val Asn Asn Leu Ser Pro Leu 
945 950 955 96 0 

Lys Leu Ala Val Arg Val Asn Phe Trp Val Pro Val Leu Leu Asn GlV 
965 970 975 

Val Ala Val Trp Asp Val Thr Leu Ser Ser Pro Ala Gin Gly Val Ser 
980 985 990 

Cys Val Ser Gin Met Lye Pro Pro Gin Asn Pro Asp Phe Leu Thr Gin 
995 1000 1005 

lie Gin Arg Arg Ser Val Leu Asp Cys Ser lie Ala Asp Cys Leu His 
1010 1015 1020 

Phe Arg Cys Asp He Pro ser Leu Asp He Gin Asp Glu Leu Asp Phe 
1025 1030 1035 1040 

He Leu Arg Gly Asn Leu Ser Phe Gly Trp Val Ser Gin Thr Leu Gin 
1045 ; 1050 1055 

Glu Lys Val Lou Leu Val Ser Glu Ala Glu He Thr Phe Asp Thr Ser 
1060 1065 . 1070 

Val Tyr Ser Cln Leu Pro Gly Gin Glu Ala Phe Leu Arg Ala Gin Val 
1075 1080 1085 

Glu Thr Thr Leu Clu Glu Tyr Val Val Tyr Glu Pro lie Phe Leu Val 
1090 1095 HOO 

Ala Gly Ser Ser Val Gly Gly Leu Leu Leu Leu Ala Leu He Thr Val 
1105 mo ins 1120 

Val Leu Tyr Lys Leu Gly Xaa Xaa Lys Arg Gin Tvr Lys Glu Met Leu 
1125 H30 H35 

Asp Gly Lys Ala Ala Asp Pro -Val Thr Xaa Gly Gin Ala Asp Phe Gly 
1140 H45 iiso 

Cys Glu Thr Pro Pro Tyr Leu Val Ser 
1155 H60 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear '•" 

(ii) MOLECULE TYPE : DNA 
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(xij SEQUENCE DESCRIPTION: SEQ ID NO:56: 



CCTGTCATGG GTCTAACCTG 



20 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 
AGCTTACACC CATGACAGG 19 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: , linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GGCCTTGCAG CTGGACAATG 20 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : * 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
CCAAAGCTGG CTGCATCCTC TC 22 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CCGCCTGCCA CTGGCGTGTG C 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CCCAGATGAA GGACTTCGTC AA 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid . --■ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GCTGGGATCA TTCGCTATGC 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
CAATGGATGG ACCAGTTCTG G 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single- 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 



CAGATCGGCT CCTACTTTGG 



20 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : Single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CATGGAGCCT CGAGACAGG 19 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CCACTGTCCT CGAAGCTGGA G 21 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH : 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CTTCGTCCTG TGCTGGCTGT GGGCTC 26 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bass pairs 

(B) TYPE: nucleic acid' 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
CCCCTGGCAT GTGAGGCTGA G 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: is ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CCGTCATCAG TAGGCAGGAA G 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GTCACAGAGG GAACCTCC 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 
GCTCCTGAGT GAGGCTGAAA TCA 
(2) INFORMATION FOR SEQ ID NO: 72: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GAGATGCTGG ATCTACCATC TGC 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 
CTGAGCTGGG AGATTTTTAT GG 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GTGGATCAGC ACTGAAATCT G 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CGTTTGAAGA AGCCAAGCTT G 
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(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
CACAGCGGAG GTGCAGGCAG 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single. - 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
CTCACTGCTT GCGCTGGC 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CGGTAAGATA GCTCTGCTGG 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
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CAGCCCACAG CCAGCACAGG 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
GATCCAACGC CAGATCATAC C 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 baoe pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: 
CACGGCCAGG TCCACCAGGC 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CACGTCCCCT AGCACTGTCA G 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single * 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
TTGACGAAGT CCTTCATCTG GG 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GAACTGCAAG CTGGAGCCCA G 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 
CTCGATCCTG CCAAGTGCTA C 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:86: 
GCCTTGGAGC TGGACGATGG C 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 : 
GTAAGATCTC CAGAGTGTCC AAGACAAGAG ATG 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
CTTCTCGAGT GTGAGAGCTG AACTGA&ACC TTC 
(2) INFORMATION FOR SEQ ID NO:: 89s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
CGCTGTGACG TCAGAGTTGA GTCCAAATAT GG 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
GGTGACACTA TAGAATAGGG C 

(2) INFORMATION FOR SEQ ID NO: 91: ■ . '. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 



4 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 

AAGCAGGAGCTCCTGTGT 1 8 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 852 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

( ix ) FEATURE : 

(A) NAME/KEY: CDS : 

( B) LOCATION: 61* ,852 

(xi) SEQUENCE DESCRIPTION: SEQ ' ID NO: 92: 

TGATCTCCCT CCAGGCCACT GTTCCCTCTC CACTTCCCCT CACCGCTGCA CTG CTC AG AG 60 

ATG GCC CTT GGG GCT GTG GTC CTC CTT GGG GTC CTG GCT TCT TAC CAC 108 
Met Ala Leu Gly Ala Val Val Leu Leu Gly Val Leu Ala Ser Tyr His 
1 5 10 15 

GGA TTC AAC TTG GAC GTG ATG AGC GGT GAT CTT CCA GGA AGA CGC AGC 156 
Gly Phe Asn Leu Asp Val Met Ser Gly . Asp Leu Pro Gly Arg Arg Ser 
20 25 30 

GGG CTT CGG GCA GAG CGT GAT GCA GTT TGG GGA TCT CGA CTC GTG GTG 204 
Gly Leu Arg Ala Glu Arg Asp Ala Val Trp Gly Ser Arg Leu Val Val 
35 40 45 

GGA GCC CCC CTG GCG GTG GTG TCG GCC AAC CAC ACA GGA CGG CTG TAC 252 
Gly Ala Pro Leu Ala Val Val Ser Ala Asn His Thr Gly Arg Leu Tyr 
50 55 60 

GAG TGT GCG CCT GCC TCC GGC ACC TGC ACG CCC ATT TTC CCA TTC ATG 300 
Glu Cys Ala Pro Ala Ser Gly Thr Cys Thr Pro lie Phe Pro Phe Met 
65 70 75 80 

CCC CCC GAA GCC GTG AAC ATG TCC CTG GGC CTG TCC CTG GCA GCC TCC 348 
Pro Pro Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu .Ala Ala Ser 
85 90 95 

CCC AAC CAT TCC CAG CTG CTG GCT TGT GGC CCG ACC GTG CAT AGA GCC 396 
Pro Asn His Ser Gin Leu Leu Ala Cys Gly Pro Thr Val His Arg Ala 
100 . 105 - 110 
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TGC GGG GAG GAC GTG TAG GCC CAG GGT TTC TGT GTG CTG CTG GAT GCC 444 

Cye Gly Glu Asp Val Tyr Ala Gin Gly Phe Cys Val Leu Leu Asp Ala 

115 120 125 

CAC GCA CAG CCC ATC GGG ACT CTG CCA GCT GCC CTG CCC GAG TGC CCA 492 
His Ala Gin Pro lie Gly Thr Val Pro Ala Ala Leu Pro Glu CyB Pro 
130 135 140 

GAT CAA GAG ATG GAC ATT <GTC TTC CTG ATT GAC GGC TCT GGC AGC ATT 540 
Asp Gin Glu Met Asp lie Val Phe Leu lie Asp Gly Ser Gly Ser lie 
145 150 155 160 

AGC TCA AAT GAC TTC CGC AAG ATG AAG GAC TTT GTC AGA GCT GTG ATG 588 
Ser Ser Asn Asp Phe Arg Lys Met Lys Asp Phe Val Arg Ala Val Met 
165 170 175 

GAC CAG TTC AAG GAC ACC AAC ACC CAG TTC TCG CTG ATG CAG TAC TCC 636 
Asp Gin Phe Lys Asp Thr Asn Thr Gin Phe Ser Leu Met Gin Tyr Ser 
180 . 185 190 

AAT GTG CTG GTG ACA CAT TTC ACC TTC AGC AGC TTC CGG AAC AGC TCC 684 
Asn Val Leu Val Thr His Phe Thr Phe Ser Ser Phe Arg Asn Ser Ser 
195 200 205 

AAT CCT CAG GGC CTA GTG GAG CCC ATT GTG CAG CTG ACA GGC CTC ACG 732 
Asn Pro Gin Gly Leu Val Glu Pro lie Val Gin Leu Thr Gly Leu Thr 
210 215 220 

TTC ACG GCC ACA GGG ATC CTG AAA GTG GTG ACA GAG CTG TTT CAA ACC 780 
Phe Thr Ala Thr Gly lie Leu Lys Val Val Thr Glu Leu Phe Gin Thr 
225 230 235 240 

AAG AAC GGG GCC CGC GAA AGT GCC AAG AAG ATC CTC ATC GTC ATC ACA 828 
Lys Asn Gly Ala Arg Glu Ser Ala Lys Lyo lie Leu lie Val lie Thr 
245 250 255 

GAT GGG CAG AAG TAC AAA GCG GCA 852 
Asp Gly Gin Lys Tyr Lys Ala Ala 
260 

(2) INFORMATION FOR SEQ ID NO: 93: 

<i) SEQUENCE CHARACTERISTICS: 

, (A) LENGTH: 264 amino acids 
(B) TYPE: amine acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

Met Ala Leu Gly Ala Val Val Leu Leu Gly Val Leu Ala Ser Tyr His 
1 5 .10 15 

Gly Phe Asn Leu Asp Val Met Ser Gly Asp Leu Pro Gly Arg Arg Ser 

20 ... 25 30 

Gly Leu Arg Ala Glu Arg Asp Ala Val Trp Gly Ser Arg Leu Val Val 
35 40 45 
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Gly Ala Pro Leu Ala Val Val Ser Ala Asn His Thr Gly Arq Leu Tvr 
50 55 60 

Glu Cys Ala Pro Ala Ser Gly Thr Cys Thr Pro He Phe Pro Phe Met 
65 70 75 80 

Pro Pro Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu Ala Ala Ser 
85 90 95 

Pro Aon Hie Ser Gin Leu Leu Ala Cye Gly Pro Thr Val Hie Arg Ala 
100 105 HO 

Cys Gly Glu Asp Val Tyr Ala Gin Gly Phe Cys Val Leu Leu Asp Ala 
115 120 125 

His Ala Gin Pro He Gly Thr Val Pro Ala Ala Leu Pro Glu Cys Pro 
130 135 140 

Asp Gin Glu Met Asp He Val Phe Leu He Asp Gly Ser Gly Ser He 
14 5 150 155 160 

Ser Ser Asn Asp Phe Arg Lys Met Lys Asp Phe Val Arg Ala Val Met 
165 170 175 

Asp Gin Phe Lys Asp Thr Asn Thr Gin Phe Ser Leu Met Gin Tyr Ser 

180 . .. 185 -j 190 

Asn Val Leu Val Thr His Phe Thr Phe Ser Ser Phe Arg Asn Ser Ser 
195 200 205 

Asn Pro Gin Cly Lou Val Glu Pro He Val Gin Leu Thr Gly Leu Thr 
210 215 220 

Phe Thr Ala Thr Cly lie Leu Lys Val Val Thr Glu Leu Phe Gin Thr 
225 230 235 240 

Lys Asn Gly Ala Arg Glu Ser Ala Lys Lys He Leu He Val He Thr 
245 250 255 

Asp Gly Gin Lys Tyr Lys Ala Ala 
260 
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WHAT IS CLAIMED IS: 

1. A purified and isolated a d polynucleotide consisting essentially 
of human <* d protein coding sequence set out in SEQ ID NO: 1. 

2. The polynucleotide of claim 1 which is a DNA molecule. 

3. The DNA molecule of claim 2 which is a cDNA molecule. 

4. The DNA molecule of claim 2 which is a genomic DNA 

molecule. 

5. The DNA molecule of claim 2 which is a wholly or partially 
chemically synthesized DNA molecule. 

6. A full length purified and isolated a d -encoding polynucleotide 
selected from the group consisting of: 

a) the human DNA sequence set out in SEQ ID NO: 1, and 

b) a DNA molecule which hybridizes under stringent conditions 
to the noncoding strand of the protein coding portion of the DNA of a). 

7. A DNA molecule encoding the human a d amino acid sequence 
set out in SEQ ID NO: 2. 

8. A DNA expression construct comprising a DNA molecule 
according to claim 2. 

9. A host cell transformed with a DNA molecule according to 

claim 2. 
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10. A method for producing an a d polypeptide comprising growing 
a host cell according to claim 9 in a suitable medium and isolating a d polypeptide 



from said host cell or the medium of its growth. 

11. Purified and isolated or d polypeptide consisting essentially of 
the human a d amino acid sequence set out in SEQ ID NO: 2. 



14. An antibody according to claim 13 which is a monoclonal 

antibody. 

15. An anti-idibtype antibody specific for the monoclonal antibody 

of claim 14. 

16. A hybridoma cell iirie producing the monoclonal antibody 
according to claim 14. 

17. A purified and isolated a d extracellular domain polypeptide 
fragment comprising amino acids 17 to 1108 of the human a d amino acid 
sequence set out in SEQ ID NO: 2. 

18. A purified and isolated a d I domain polypeptide fragment 
comprising amino acids 145 to 355 of the human a d amino acid sequence set out 
in SEQ ID NO: 2. 



12. A polypeptide capable of specifically binding to a d . 



13. A polypeptide according to claim 12 which is an antibody. 
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19. A fusion protein comprising a d extracellular domain 
polypeptide amino acids 17 to 1 108 of SEQ ID NO: 2 and human immunoglobulin 
constant domain sequences. 

20. A purified and isolated murine polynucleotide consisting 
essentially of the a subunit protein coding sequence set out in SEQ ID NO: 45. 

2 1 . A method for isolating a polynucleotide encoding a protein that 
binds to a d comprising the steps of: 

a) transforming or transfecting appropriate host cells with a DNA 
construct comprising a reporter gene under the control of a promoter regulated by 
a transcription factor having a DNA-binding domain and an activating domain; 

b) expressing in said host cells a first hybrid DNA sequence 
encoding a first fusion of part or all of a d and either the DNA binding domain or 
the activating domain of said transcription factor, 

c) expressing in said host cells a library of second hybrid DNA 
sequences encoding second fusions of part or all of putative a d binding proteins 
and the DNA binding domain or activating domain of said transcription factor 
which is not incorporated in said first fusion; 

d) detecting binding of an ot d binding protein to a d in a particular 
host cell by detecting the production of reporter gene product in said host cell; 
and 

e) isolating second hybrid DNA sequences encoding a d binding 
protein from said particular host cell. 

22. A method for identifying a compound capable of reacting 
specifically with a d and of modulating the interaction of binding partners a d and 
ICAM-R comprising the steps of: 
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a) immobilizing a d or a fragment thereof, or ICAM-R or a 
fragment thereof, on a solid support coated or impregnated with a fluorescent 
agent; 

b) labelling the non-immobilized binding partner with a compound 
capable of exciting said fluorescent agent; 

c) contacting said immobilized binding partner with said labelled 
binding partner in the presence and absence of a putative modulator compound 
capable of specifically reacting with a d ; 

d) detecting light emission by said fluorescent agent; and 

e) identifying modulating compounds as those compounds that affect 
the emission of light by said fluorescent agent in comparison to the emission of 
light by said fluorescent agent in the absence of said modulating compound. 

23. A purified and isolated a d extracellular domain polypeptide 
fragment comprising about amino acid 127 to about amino acid 353 of the human 
a d amino acid sequence set out in SEQ ID NO: 2. 

24. A fusion protein comprising the polypeptide fragment of claim 
23 and human immunoglobulin constant region sequences. 

25. A purified and isolated ot d extracellular domain polypeptide 
fragment comprising about amino acid 17 to about amino acid 603 of the human 
a d amino acid sequence set out in SEQ ID NO: 2. 

26. A fusion protein comprising the polypeptide fragment of claim 
25 and human immunoglobulin constant region sequences. 
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27. A purified and isolated <* d extracellular domain polypeptide 
fragment comprising about amino acid 17 to about about amino acid 1029 of the 
human ar d amino acid sequence set out in SEQ ID NO: 2. 

28. A fusion protein comprising the polypeptide fragment of claim 
27 and immunoglobulin constant region sequences. 

29. A purified and isolated murine polynucleotide comprising the 
a subunit protein coding sequence as set out in SEQ ID NO: 52. 

30. A purified and isolated a d polypeptide consisting essentially 
of the murine a d amino acid sequence set out in SEQ ID NO: 53. 

31. .A purified and isolated rat polynucleotide comprising the a 
subunit protein coding sequence as set out in SEQ ED NO: 54. 

32. A purified and isolated a d polypeptide consisting essentially 
of the rat a d amino acid sequence set out in SEQ ID NO: 55. 

33. A purified and isolated polypeptide fragment comprising 
extracellular domain sequences of the polypeptide of claim 32. 

34. A polypeptide capable of specifically binding to the 
polypeptide of claim 32. 

35. A polypeptide according to claim 34 which is an antibody. 

36. An antibody according to claim 35 which is a monoclonal 

antibody. 
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37. A rodent that does not express a functional a d protein. 

c 

38. A rodent that expresses a variant protein. 
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